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NOVEL HUMAN GENES AND GENE EXPRESSION PRODUCTS I 



Cross-References to Related Applications 

This application is a continuation-in-part of U.S. provisional patent application serial 
5 no. 60/068,755, filed December 23, 1997, and of U.S. provisional patent application serial 
no. 60/080,664, filed April 3, 1998, and of U.S. provisional patent application serial no. 
60/105,234, filed October 21, 1998, each of which applications are incorporated herein by 
reference. 



10 Field of the Invention 

The present invention relates to novel polynucleotides, particularly to novel 
polynucleotides of human origin that are expressed in a selected cell type, are differentially 
expressed in one cell type relative to another cell type (e.g., in cancerous cells, or in cells of a 
specific tissue origin) and/or share homology to polynucleotides encoding a gene product 

15 having an identified functional domain and/or activity. 



Background of the Invention 

Identification of novel polynucleotides, particularly those that encode an expressed 
gene product, is important in the advancement of drug discovery, diagnostic technologies, 

20 and the understanding of the progression and nature of complex diseases such as cancer. 
Identification of genes expressed in different cell types isolated from sources that differ in 
disease state or stage, developmental stage, exposure to various environmental factors, the 
tissue of origin, the species from which the tissue was isolated, and the like is key to 
identifying the genetic factors that are responsible for the phenotypes associated with these 

25 various differences 

This invention provides novel human polynucleotides, the polypeptides encoded by 
these polynucleotides, and the genes and proteins corresponding to these novel 
polynucleotides. 



30 Summary of the Invention 

This invention relates to novel human polynucleotides and variants thereof, their 
encoded polypeptides and variants thereof, to genes corresponding to these polynucleotides 

1 
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and to proteins expressed by the genes. The invention also relates to diagnostic and 

therapeutic agents employing such novel human polynucleotides, their corresponding genes 

or gene products, e.g., these genes and proteins, including probes, antisense constructs, and 

antibodies. 

5 Accordingly, in one embodiment, the present invention features a library of 

polynucleotides, the library comprising the sequence information of at least one of SEQ ID 
NOS: 1-844. In related aspects, the invention features a library provided on a nucleic acid 
array, or in a computer-readable format. 

In one embodiment, the library is comprises a differentially expressed polynucleotide 

10 comprising a sequence selected from the group consisting of SEQ ID NOS:9, 39, 42, 52, 62, 
74, 1 1 9, 1 72, 3 1 7, and 379. In specific related embodiments, the library comprises: 1 ) a 
polynucleotide that is differentially expressed in a human breast cancer cell, where the 
polynucleotide comprises a sequence selected from the group consisting of SEQ ID NOS: 4, 
9, 39, 42, 52, 62, 65, 66, 68, 74, 81, 1 14, 123, 144, 130, 157, 162, 172, 178, 183, 202, 214, 

15 219, 223, 258, 298, 317, 338, 379, 384, 386, and 388; 2) a polynucleotide differentially 
expressed in a human colon cancer cell, where the polynucleotide comprises a sequence 
selected from the group consisting of SEQ ID NOS: 1, 39, 52, 97, 1 19, 134, 172, 176, 241, 
288, 317, 357, 362, and 374; or 3) a polynucleotide differentially expressed in a human lung 
cancer cell, where the polynucleotide comprises a sequence selected from the group 

20 consisting of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 1 19, 135, 154, 160, 260, 308, 323, 349, 
361, 369, 371, 379, 395, 381, and 400. 

In another aspect, the invention features an isolated polynucleotide 
comprising a nucleotide sequence having at least 90% sequence identity to an identifying 
sequence of SEQ ID NOS: 1-844 or a degenerate variant thereof. In related aspects, the 

25 invention features recombinant host cells and vectors comprising the polynucleotides of the 
invention, as well as isolated polypeptides encoded by the polynucleotides of the invention 
and antibodies that specifically bind such polypeptides. 

In one embodiment, the invention features an isolated polynucleotide comprising a 
sequence encoding a polypeptide of a protein family selected from the group consisting of: 

30 4 transmembrane segments integral membrane proteins, 7 transmembrane receptors, 

ATPases associated with various cellular activities (AAA), eukaryotic aspartyl proteases, 



2 



WO 99/33982 PCT/US98/27610 
GATA family of transcription factors, G-protein alpha subunit, phorbol esters/diacylglycerol 

binding proteins, protein kinase, protein phosphatase 2C, protein tyrosine phosphatase, 

trypsin, wnt family of developmental signaling proteins, and WW/rsp5/WWP domain 

containing proteins. In a specific related embodiment, the invention features a 

5 polynucleotide comprising a sequence of one of SEQ ID NOS: 24, 41, 101, 157, 291, 305, 

315, 341, 63, 1 16, 134, 136, 151, 384, 404, 308, 213, 367, 188, 251, 202, 315, 367, 397, 256, 

382, 169, 23, 291, 324, 330, 341, 353, 188, 379 , and 395. 

In another embodiment, the invention features a polynucleotide comprising a 

sequence encoding a polypeptide having a functional domain selected from the group 

1 0 consisting of: Ank repeat, basic region plus leucine zipper transcription factors, 

bromodomain, EF-hand, SH3 domain, WD domain/G-beta repeats, zinc finger (C2H2 type), 
zinc finger (CCHC class), and zinc-binding metalloprotease domain. In a specific related 
embodiment, the invention features a polynucleotide comprising a sequence of one of SEQ 
ID NOS: 116, 251, 374, 97, 136, 242, 379, 306, 386, 18, 335, 61, 306, 386, 322, 306, and 

15 395. 

In another aspect, the invention features a method of detecting differentially 
expressed genes correlated with a cancerous state of a mammalian cell, where the method 
comprises the step of detecting at least one differentially expressed gene product in a test 
sample derived from a cell suspected of being cancerous, where the gene product is encoded 

20 by a gene corresponding to a sequence of at least one of SEQ ID NOS:4, 9, 39, 42, 52, 62, 
65, 66, 68, 74, 81, 1 14, 123, 144, 130, 157, 162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 
317, 338, 379, 384, 386, 388, 1, 39, 52, 97, 119, 134, 172, 176, 241, 288, 317, 357, 362, 374, 
9, 34, 42, 62, 74, 106, 119, 135, 154, 160, 260, 308, 323, 349, 361, 369, 371, 379, 395, 381, 
and 400. Detection of the differentially expressed gene product is correlated with a 

25 cancerous state of the cell from which the test sample was derived. In one embodiment, the 
detecting is by hybridization of the test sample to a reference array, wherein the reference 
array comprises an identifying sequence of at least one of SEQ ID NOS:l-844. 

In one embodiment of the method of the invention, the cell is a breast tissue derived 
cell, and the differentially expressed gene product is encoded by a gene corresponding to a 

30 sequence of at least one of SEQ ID NOS: 4, 9, 39, 42, 52, 62, 65, 66, 68, 74, 81, 1 14, 123, 
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144, 130, 157, 162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 317, 338, 379, 384, 386, 

and 388. 

In another embodiment of the method of the invention, the cell is a colon tissue 
derived cell, and differentially expressed gene product is encoded by a gene corresponding to 
a sequence of at least one of SEQIDNOS: 1,39, 52, 97, 119, 134, 172, 176, 241,288,317, 
357, 362, and 374. 

In yet another embodiment of the method of the invention, the cell is a lung tissue 
derived cell, and differentially expressed gene product is encoded by a gene corresponding to 
a sequence of at least one of SEQ ID NOS: 9, 34, 42, 62, 74, 1 06, 1 1 9, 1 35, 1 54, 1 60, 260, 
308, 323, 349, 361, 369, 371, 379, 395, 381, and 400. 

Other aspects and embodiments of the invention will be readily apparent to the 
ordinarily skilled artisan upon reading the description provided herein. 

Detailed Description of the Invention 

The invention relates to polynucleotides comprising the disclosed nucleotide 
sequences, to full length cDNA, mRNA and genes corresponding to these sequences, and to 
polypeptides and proteins encoded by these polynucleotides and genes. 

Also included are polynucleotides that encode polypeptides and proteins encoded by 
the polynucleotides of the Sequence Listing. The various polynucleotides that can encode 
these polypeptides and proteins differ because of the degeneracy of the genetic code, in that 
most amino acids are encoded by more than one triplet codon. The identity of such codons 
is well-known in this art, and this information can be used for the construction of the 
polynucleotides within the scope of the invention. 

Polynucleotides encoding polypeptides and proteins that are variants of the 
polypeptides and proteins encoded by the polynucleotides and related cDNA and genes are 
also within the scope of the invention. The variants differ from wild type protein in having 
one or more amino acid substitutions that either enhance, add, or diminish a biological 
activity of the wild type protein. Once the amino acid change is selected, a polynucleotide 
encoding that variant is constructed according to the invention. 

The following detailed description describes the polynucleotide compositions 
encompassed by the invention, methods for obtaining cDNA or genomic DNA encoding a 
full-length gene product, expression of these polynucleotides and genes, identification of 
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structural motifs of the polynucleotides and genes, identification of the function of a gene 

product encoded by a gene corresponding to a polynucleotide of the invention, use of the 

provided polynucleotides as probes and in mapping and in tissue profiling, use of the 

corresponding polypeptides and other gene products to raise antibodies, and use of the 

5 polynucleotides and their encoded gene products for therapeutic and diagnostic purposes. 

I. Polynucleotide Compositions 

The scope of the invention with respect to polynucleotide compositions includes, but is not 
necessarily limited to, polynucleotides having a sequence set forth in any one of SEQ ID 

10 NOS: 1-844; polynucleotides obtained from the biological materials described herein or other 
biological sources (particularly human sources) by hybridization under stringent conditions 
(particularly conditions of high stringency); genes corresponding to the provided 
polynucleotides; variants of the provided polynucleotides and their corresponding genes, 
particularly those variants that retain a biological activity of the encoded gene product (e.g., 

1 5 a biological activity ascribed to a gene product corresponding to the provided 

polynucleotides as a result of the assignment of the gene product to a protein family(ies) 
and/or identification of a functional domain present in the gene product). Other nucleic acid 
compositions contemplated by and within the scope of the present invention will be readily 
apparent to one of ordinary skill in the art when provided with the disclosure here. 

20 The invention features polynucleotides that are expressed in cells of human tissue, 

specifically human colon, breast, and/or lung tissue. Novel nucleic acid compositions of the 
invention of particular interest comprise a sequence set forth in any one of SEQ ID NOS: 1- 
844 or an identifying sequence thereof. An "identifying sequence" is a contiguous sequence 
of residues at least about 1 0 nt to about 20 nt in length, usually at least about 50 nt to about 

25 100 nt in length, that uniquely identifies a polynucleotide sequence, e.g., exhibits less than 
90%, usually less than about 80% to about 85% sequence identity to any contiguous 
nucleotide sequence of more than about 20 nt. Thus, the subject novel nucleic acid 
compositions include full length cDNAs or mRNAs that encompass an identifying sequence 
of contiguous nucleotides from any one of SEQ ID NOS: 1-844. 

30 The polynucleotides of the invention also include polynucleotides having sequence 

similarity or sequence identity. Nucleic acids having sequence similarity are detected by 
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hybridization under low stringency conditions, for example, at 50°C and 10XSSC (0.9 M 

saline/0.09 M sodium citrate) and remain bound when subjected to washing at 55°C in 

1XSSC. Sequence identity can be determined by hybridization under stringent conditions, 

for example, at 50°C or higher and 0.1XSSC (9 mM saline/0.9 mM sodium citrate), 

5 Hybridization methods and conditions are well known in the art, see, e.g. , U.S. Patent No. 

5,707,829. Nucleic acids that are substantially identical to the provided polynucleotide 

sequences, e.g. allelic variants, genetically altered versions of the gene, etc., bind to the 

provided polynucleotide sequences (SEQ ID NOS: 1-844) under stringent hybridization 

conditions. By using probes, particularly labeled probes of DNA sequences, one can isolate 

1 0 homologous or related genes. The source of homologous genes can be any species, e.g. 

primate species, particularly human; rodents, such as rats and mice, canines, felines, bovines, 
ovines, equines, yeast, nematodes, etc. 

Preferably, hybridization is performed using at least 1 5 contiguous nucleotides of at 
least one of SEQ ID NOS: 1-844. That is, when at least 15 contiguous nucleotides of one of 

15 the disclosed SEQ ID NOs. is used as a probe, the probe will preferentially hybridize with a 
gene or mRNA (of the biological material) comprising the complementary sequence, 
allowing the identification and retrieval of the nucleic acids of the biological material that 
uniquely hybridize to the selected probe. Probes from more than one SEQ ID NO. will 
hybridize with the same gene or mRNA if the cDNA from which they were derived 

20 corresponds to one mRNA. Probes of more than 15 nucleotides can be used, but 15 
nucleotides represents enough sequence for unique identification. 

The polynucleotides of the invention also include naturally occurring variants of the 
nucleotide sequences (e.g., degenerate variants, allelic variants, etc.). Variants of the 
polynucleotides of the invention are identified by hybridization of putative variants with 

25 nucleotide sequences disclosed herein, preferably by hybridization under stringent conditions 
For example, by using appropriate wash conditions, variants of the polynucleotides of the 
invention can be identified where the allelic variant exhibits at most about 25-30% base pair 
mismatches relative to the selected polynucleotide probe. In general, allelic variants contain 
15-25% base pair mismatches, and can contain as little as even 5-15%, or 2-5%, or 1-2% 

30 base pair mismatches, as well as a single base-pair mismatch. 
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The invention also encompasses homologs corresponding to the polynucleotides of 

SEQ ID NOS: 1-844, where the source of homologous genes can be any mammalian species, 

e.g., primate species, particularly human; rodents, such as rats, canines, felines, bovines, 

ovines, equines, yeast, nematodes, etc. Between mammalian species, e.g., human and 

5 mouse, homologs have substantial sequence similarity, e.g., at least 75% sequence identity, 

usually at least 90%, more usually at least 95% between nucleotide sequences. Sequence 

similarity is calculated based on a reference sequence, which may be a subset of a larger 

sequence, such as a conserved motif, coding region, flanking region, etc. A reference 

sequence will usually be at least about 18 contiguous nt long, more usually at least about 30 

10 nt long, and may extend to the complete sequence that is being compared. Algorithms for 
sequence analysis are known in the art, such as BLAST, described in Altschul et al. y 1 Mol. 
BioL (1990) 275:403-10. 

In general, variants of the invention have a sequence identity greater than at least 
about 65%, preferably at least about 75%, more preferably at least about 85%, and can be 

1 5 greater than at least about 90% or more as determined by the Smith- Waterman homology 
search algorithm as implemented in MPSRCH program (Oxford Molecular). For the 
purposes of this invention, a preferred method of calculating percent identity is the Smith- 
Waterman algorithm, using the following. Global DNA sequence identity must be greater 
than 65% as determined by the Smith- Waterman homology search algorithm as implemented 

20 in MPSRCH program (Oxford Molecular) using an affme gap search with the following 
search parameters: gap open penalty, 12; and gap extension penalty, 1 . 

The subject nucleic acids can be cDNAs or genomic DNAs, as well as fragments 
thereof, particularly fragments that encode a biologically active gene product and/or are 
useful in the methods disclosed herein (e.g., in diagnosis, as a unique identifier of a 

25 differentially expressed gene of interest, etc.). The term "cDNA" as used herein is intended 
to include all nucleic acids that share the arrangement of sequence elements found in native 
mature mRNA species, where sequence elements are exons and 3 and 5 non-coding 
regions. Normally mRNA species have contiguous exons, with the intervening introns, 
when present, being removed by nuclear RNA splicing, to create a continuous open reading 

30 frame encoding a polypeptide of the invention. 
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A genomic sequence of interest comprises the nucleic acid present between the 

initiation codon and the stop codon, as defined in the listed sequences, including all of the 

introns that are normally present in a native chromosome. It can further include the 3 and 5 

untranslated regions found in the mature mRNA. It can further include specific 

5 transcriptional and translational regulatory sequences, such as promoters, enhancers, etc., 

including about 1 kb, but possibly more, of flanking genomic DNA at either the 5 and 3 end 

of the transcribed region. The genomic DNA can be isolated as a fragment of 1 00 kbp or 

smaller; and substantially free of flanking chromosomal sequence. The genomic DNA 

flanking the coding region, either 3 and 5 , or internal regulatory sequences as sometimes 

10 found in introns, contains sequences required for proper tissue, stage-specific, or disease- 
state specific expression. 

The nucleic acid compositions of the subject invention can encode all or a part of the 
subject differentially expressed polypeptides. Double or single stranded fragments can be 
obtained from the DNA sequence by chemically synthesizing oligonucleotides in accordance 

1 5 with conventional methods, by restriction enzyme digestion, by PCR amplification, etc. 
Isolated polynucleotides and polynucleotide fragments of the invention comprise at least 
about 10, about 15, about 20, about 35, about 50, about 100, about 150 to about 200, about 
250 to about 300, or about 350 contiguous nucleotides selected from the polynucleotide 
sequences as shown in SEQ ID NOS: 1 -844. For the most part, fragments will be of at least 

20 1 5 nt, usually at least 1 8 nt or 25 nt, and up to at least about 50 contiguous nt in length or 
more. In a preferred embodiment, the polynucleotide molecules comprise a contiguous 
sequence of at least twelve nucleotides selected from the group consisting of the 
polynucleotides shown in SEQ ID NOS: 1-844. 

Probes specific to the polynucleotides of the invention can be generated using the 

25 polynucleotide sequences disclosed in SEQ ID NOS: 1-844. The probes are preferably at 
least about 1 2, 1 5, 1 6, 1 8, 20, 22, 24, or 25 nucleotide fragment of a corresponding 
contiguous sequence of SEQ ID NOS: 1-844, and can be less than 2, 1, 0.5, 0.1, or 0.05 kb in 
length. The probes can be synthesized chemically or can be generated from longer 
polynucleotides using restriction enzymes. The probes can be labeled, for example, with a 

30 radioactive, biotinylated, or fluorescent tag. Preferably, probes are designed based upon an 
identifying sequence of a polynucleotide of one of SEQ ID NOS: 1 -844. More preferably, 
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probes are designed based on a contiguous sequence of one of the subject polynucleotides 

that remain unmasked following application of a masking program for masking low 

complexity (e.g., XBLAST) to the sequence., i.e., one would select an unmasked region, as 

indicated by the polynucleotides outside the poly-n stretches of the masked sequence 

5 produced by the masking program. 

The polynucleotides of the subject invention are isolated and obtained in substantial 

purity, generally as other than an intact chromosome. Usually, the polynucleotides, either as 

DNA or RNA, will be obtained substantially free of other naturally-occurring nucleic acid 

sequences, generally being at least about 50%, usually at least about 90% pure and are 

1 0 typically "recombinant", e.g. , flanked by one or more nucleotides with which it is not 
normally associated on a naturally occurring chromosome. 

The polynucleotides of the invention can be provided as a linear molecule or within a 
circular molecule. They can be provided within autonomously replicating molecules 
(vectors) or within molecules without replication sequences. They can be regulated by their 

1 5 own or by other regulatory sequences, as is known in the art. The polynucleotides of the 
invention can be introduced into suitable host cells using a variety of techniques which are 
available in the art, such as transferrin polycation-mediated DNA transfer, transfection with 
naked or encapsulated nucleic acids, liposome-mediated DNA transfer, intracellular 
transportation of DNA-coated latex beads, protoplast fusion, viral infection, electroporation, 

20 gene gun, calcium phosphate-mediated transfection, and the like. 

The subject nucleic acid compositions can be used to, for example, produce 
polypeptides, as probes for the detection of mRNA of the invention in biological samples 
(e.g., extracts of human cells) to generate additional copies of the polynucleotides, to 
generate ribozymes or antisense oligonucleotides, and as single stranded DNA probes or as 

25 triple-strand forming oligonucleotides. The probes described herein can be used to, for 
example, determine the presence or absence of the polynucleotide sequences as shown in 
SEQ ID NOS: 1 -844 or variants thereof in a sample. These and other uses are described in 
more detail below. 
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Use of Polynucleotides to Obtain Full-Length cDNA and Full-Length Human Gene and 

Promoter Region 

Full-length cDNA molecules comprising the disclosed polynucleotides are obtained 
as follows. A polynucleotide having a sequence of one of SEQ ID NOS: 1 -844, or a portion 
5 thereof comprising at least 12, 1 5, 18, or 20 nucleotides, is used as a hybridization probe to 
detect hybridizing members of a cDNA library using probe design methods, cloning 
methods, and clone selection techniques such as those described in U.S. Patent No. 
5,654,173. Libraries of cDNA are made from selected tissues, such as normal or tumor 
tissue, or from tissues of a mammal treated with, for example, a pharmaceutical agent. 

10 Preferably, the tissue is the same as the tissue from which the polynucleotides of the 
invention were isolated, as both the polynucleotides described herein and the cDNA 
represent expressed genes. Most preferably, the cDNA library is made from the biological 
material described herein in the Examples. Alternatively, many cDNA libraries are available 
commercially. (Sambrook et al., Molecular Cloning: A Laboratory Manual 2nd Ed, (1 989) 

15 Cold Spring Harbor Press, Cold Spring Harbor, NY). The choice of cell type for library 

construction can be made after the identity of the protein encoded by the gene corresponding 
to the polynucleotide of the invention is known. This will indicate which tissue and cell 
types are likely to express the related gene, and thus represent a suitable source for the 
mRNA for generating the cDNA. Where the provided polynucleotides are isolated from 

20 cDNA libraries, the libraries are prepared from mRNA of human colon cells, more 

preferably, human colon cancer cells, even more preferably, from a highly metastatic colon 
cell, Kml2L4-A. 

Techniques for producing and probing nucleic acid sequence libraries are described, 
for example, in Sambrook et aL, Molecular Cloning: A Laboratory Manual 2nd Ed., (1989) 
25 Cold Spring Harbor Press, Cold Spring Harbor, NY. The cDNA can be prepared by using 
primers based on sequence from SEQ ID NOS: 1-844. In one embodiment, the cDNA library 
can be made from only poly-adenylated mRNA. Thus, poly-T primers can be used to 
prepare cDNA from the mRNA. 

Members of the library that are larger than the provided polynucleotides, and 
30 preferably that encompass the complete coding sequence of the native message, are obtained. 
In order to confirm that the entire cDNA has been obtained, RNA protection experiments 
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are performed as follows. Hybridization of a full-length cDNA to an mRNA will protect the 

UNA from RNase degradation. If the cDNA is not full length, then the portions of the 

mRNA that are not hybridized will be subject to RNase degradation. This is assayed, as is 

known in the art, by changes in electrophoretic mobility on polyacrylamide gels, or by 

detection of released monoribonucleotides. Sambrook et al, Molecular Cloning: A 

Laboratory Manual 2nd Ed. , (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY. In 

order to obtain additional sequences 5' to the end of a partial cDNA, 5' RACE (PCR 

Protocols: A Guide to Methods and Applications, (1990) Academic Press, Inc.) is performed. 

Genomic DNA is isolated using the provided polynucleotides in a manner similar to 
the isolation of full-length cDNAs. Briefly, the provided polynucleotides, or portions 
thereof, are used as probes to libraries of genomic DNA. Preferably, the library is obtained 
from the cell type that was used to generate the polynucleotides of the invention, but this is 
not essential. Most preferably, the genomic DNA is obtained from the biological material 
described herein in the Examples. Such libraries can be in vectors suitable for carrying large 
segments of a genome, such as PI or YAC, as described in detail in Sambrook et al, 9.4- 
9.30. In addition, genomic sequences can be isolated from human BAC libraries, which are 
commercially available from Research Genetics, Inc., Huntville, Alabama, USA, for 
example. In order to obtain additional 5* or 3* sequences, chromosome walking is performed, 
as described in Sambrook et al., such that adjacent and overlapping fragments of genomic 
DNA are isolated. These are mapped and pieced together, as is known in the art, using 
restriction digestion enzymes and DNA ligase. 

Using the polynucleotide sequences of the invention, corresponding full-length genes 
can be isolated using both classical and PCR methods to construct and probe cDNA libraries. 
Using either method, Northern blots, preferably, are performed on a number of cell types to 
determine which cell lines express the gene of interest at the highest level. Classical 
methods of constructing cDNA libraries are taught in Sambrook et al., supra. With these 
methods, cDNA can be produced from mRNA and inserted into viral or expression vectors. 
Typically, libraries of mRNA comprising poly(A) tails can be produced with poly(T) 
primers. Similarly, cDNA libraries can be produced using the instant sequences as primers. 

PCR methods are used to amplify the members of a cDNA library that comprise the 
desired insert. In this case, the desired insert will contain sequence from the full length 
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cDNA that corresponds to the instant polynucleotides. Such PCR methods include gene 

trapping and RACE methods. Gene trapping entails inserting a member of a cDNA library 

into a vector. The vector then is denatured to produce single stranded molecules. Next, a 

substrate-bound probe, such a biotinylated oligo, is used to trap cDNA inserts of interest. 

5 Biotinylated probes can be linked to an avidin-bound solid substrate. PCR methods can be 

used to amplify the trapped cDNA. To trap sequences corresponding to the full length 

genes, the labeled probe sequence is based on the polynucleotide sequences of the invention. 

Random primers or primers specific to the library vector can be used to amplify the trapped 

cDNA. Such gene trapping techniques are described in Gruber et aL 9 WO 95/04745 and 

10 Gruber et al. y U.S. Pat. No. 5,500,356. Kits are commercially available to perform gene 
trapping experiments from, for example, Life Technologies, Gaithersburg, Maryland, USA. 

"Rapid amplification of cDNA ends" or RACE, is a PCR method of amplifying 
cDNAs from a number of different RNAs. The cDNAs are ligated to an oligonucleotide 
linker, and amplified by PCR using two primers. One primer is based on sequence from the 

1 5 instant polynucleotides, for which full length sequence is desired, and a second primer 
comprises sequence that hybridizes to the oligonucleotide linker to amplify the cDNA. A 
description of this methods is reported in WO 97/191 10. In preferred embodiments of 
RACE, a common primer is designed to anneal to an arbitrary adaptor sequence ligated to 
cDNA ends (Apte and Siebert, Biotechniques (1993) 75:890-893; Edwards et aL, Nuc. Acids 

20 Res. (1991) 79:5227-5232). When a single gene-specific RACE primer is paired with the 
common primer, preferential amplification of sequences between the single gene specific 
primer and the common primer occurs. Commercial cDNA pools modified for use in RACE 
are available. 

Another PCR-based method generates full-length cDNA library with anchored ends 
25 without needing specific knowledge of the cDNA sequence. The method uses lock-docking 
primers (I- VI), where one primer, poly TV (I-III) locks over the polyA tail of eukaryotic 
mRNA producing first strand synthesis and a second primer, polyGH (IV-VI) locks onto the 
polyC tail added by terminal deoxynucleotidyl transferase (TdT). This method is described 
in WO 96/40998. 

30 The promoter region of a gene generally is located 5' to the initiation site for RNA 

polymerase II. Hundreds of promoter regions contain the "TATA" box, a sequence such as 
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TATTA or TATAA, which is sensitive to mutations. The promoter region can be obtained 

by performing 5' RACE using a primer from the coding region of the gene. Alternatively, 

the cDNA can be used as a probe for the genomic sequence, and the region 5' to the coding 

region is identified by "walking up." If the gene is highly expressed or differentially 

5 expressed, the promoter from the gene can be of use in a regulatory construct for a 

heterologous gene. 

Once the full-length cDNA or gene is obtained, DNA encoding variants can be 
prepared by site-directed mutagenesis, described in detail in Sambrook et aL, 1 5.3-15.63. 
The choice of codon or nucleotide to be replaced can be based on disclosure herein on 

10 optional changes in amino acids to achieve altered protein structure and/or function. 

As an alternative method to obtaining DNA or RNA from a biological material, 
nucleic acid comprising nucleotides having the sequence of one or more polynucleotides of 
the invention can be synthesized. Thus, the invention encompasses nucleic acid molecules 
ranging in length from 15 nucleotides (corresponding to at least 15 contiguous nucleotides of 

1 5 one of SEQ ID NOS: 1 -844) up to a maximum length suitable for one or more biological 
manipulations, including replication and expression, of the nucleic acid molecule. The 
invention includes but is not limited to (a) nucleic acid having the size of a full gene, and 
comprising at least one of SEQ ID NOS: 1-844; (b) the nucleic acid of (a) also comprising at 
least one additional gene, operably linked to permit expression of a fusion protein; (c) an 

20 expression vector comprising (a) or (b); (d) a plasmid comprising (a) or (b) ; and (e) a 
recombinant viral particle comprising (a) or (b). Once provided with the polynucleotides 
disclosed herein, construction or preparation of (a) - (e) are well within the skill in the art. 

The sequence of a nucleic acid comprising at least 15 contiguous nucleotides of at 
least any one of SEQ ID NOS: 1-844, preferably the entire sequence of at least any one of 

25 SEQ ID NOS: 1-844, is not limited and can be any sequence of A, T, G, and/or C (for DNA) 
and A, U, G, and/or C (for RNA) or modified bases thereof, including inosine and 
pseudouridine. The choice of sequence will depend on the desired function and can be 
dictated by coding regions desired, the intron-like regions desired, and the regulatory regions 
desired. Where the entire sequence of any one of SEQ ID NOS: 1-844 is within the nucleic 

30 acid, the nucleic acid obtained is referred to herein as a polynucleotide comprising the 
sequence of any one of SEQ ID NOS: 1-844. 
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II. Expression of Polypeptide Encoded bv Full-Length cDNA or Full-Length Gene 

The provided polynucleotide (e.g., a polynucleotide having a sequence of one of SEQ 
ID NOS: 1-844), the corresponding cDNA, or the full-length gene is used to express a partial 
5 or complete gene product. 

Constructs of polynucleotides having sequences of SEQ ID NOS: 1-844 can be 
generated synthetically. Alternatively, single-step assembly of a gene and entire plasmid 
from large numbers of oligodeoxyribonucleotides is described by, e.g., Stemmer et al., Gene 
(Amsterdam) (1995) 164(l):49-53. In this method, assembly PCR (the synthesis of long 

10 DNA sequences from large numbers of oligodeoxyribonucleotides (oligos)) is described. 
The method is derived from DNA shuffling (Stemmer, Nature (1994) 570:389-391), and 
does not rely on DNA ligase, but instead relies on DNA polymerase to build increasingly 
longer DNA fragments during the assembly process. For example, a 1.1 -kb fragment 
containing the TEM-1 beta-lactamase-encoding gene (bla) can be assembled in a single 

15 reaction from a total of 56 oligos, each 40 nucleotides (nt) in length. The synthetic gene can 
be PCR amplified and cloned in a vector containing the tetracycline-resistance gene (Tc-R) 
as the sole selectable marker. Without relying on ampicillin (Ap) selection, 76% of the Tc-R 
colonies were Ap-R, making this approach a general method for the rapid and cost-effective 
synthesis of any gene. 

20 Appropriate polynucleotide constructs are purified using standard recombinant DNA 

techniques as described in, for example, Sambrook et al., Molecular Cloning: A Laboratory 
Manual, 2nd Ed, (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY, and under 
current regulations described in United States Dept. of HHS, National Institute of Health 
(NIH) Guidelines for Recombinant DNA Research. The gene product encoded by a 

25 polynucleotide of the invention is expressed in any expression system, including, for 

example, bacterial, yeast, insect, amphibian and mammalian systems. Suitable vectors and 
host cells are described in U.S. Patent No. 5,654,173. 

Bacteria. Expression systems in bacteria include those described in Chang et al, 
Nature (1978) 275:615; Goeddel et al., Nature (1979) 257:544; Goeddel et al, Nucleic Acids 

30 Res. (1980) 5:4057; EP 0 036,776; U.S. Patent No. 4,551,433; DeBoer et al, Proc. Natl. 
Acad Sci. (USA) (1983) 50:21-25; and Siebenlist et al, Cell (1980) 20:269. 
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Yeast. Expression systems in yeast include those described in Hinnen et al, Proc. 

Natl. Acad. Sci. (USA) (1978) 75:1929; Ito etal.,J. Bacteriol. (1983) 753:163; Kurtz et al, 

Mol. Cell. Biol. (1986) 6:142; Kunze et al., J. Basic Microbiol. (1985) 25:141; Gleeson et 

al., J. Gen. Microbiol. (1986) 752:3459; Roggenkamp et al., Mol. Gen. Genet. (1986) 

5 202:302; Das et al., J. Bacteriol. (1 984) 755: 1 1 65; De Louvencourt et al., J. Bacteriol. 

(1983) 754:737; Van den Berg et al., Bio/Technology (1990) 5:135; Kunze et al., J. Basic 

Microbiol. (1985) 25:141; Cregg etal, Mol. Cell. Biol. (1985) 5:3376; U.S. Patent Nos. 

4,837,148 and 4,929,555; Beach and Nurse, Nature (1981) 300:106; Davidow et al., Curr. 

Genet. (1985) 70:380; Gaillardin et al., Curr. Genet. (1985) 70:49; Ballance et al., Biochem. 

10 Biophys. Res. Commun. (1983) 772:284-289; Tilburn et al, Gem (1983) 26:205-221; Yelton 
et al, Proc. Natl. Acad. Sci. (USA) (1984) 57:1470-1474; Kelly and Hynes, EMBOJ. (1985) 
4:475479; EP 0 244,234; and WO 91/00357. 

Insect Cells. Expression of heterologous genes in insects is accomplished as 
described in U.S. Patent No. 4,745,05 1 ; Friesen et al, "The Regulation of Baculovirus Gene 

15 Expression", in: 77ie Molecular Biology Of Baculoviruses (1986) (W. Doerfler, ed.); EP 0 
127,839; EP 0 155,476; and Vlak et al, J. Gen. Virol. (1988) 69:165-116; Miller et al, Ann. 
Rev. Microbiol. (1988) 42:111; Carbonell et al, Gene (1988) 73:409; Maeda etal, Nature 
(1985) 375:592-594; Lebacq-Verheyden et al, Mol. Cell. Biol (1988) 5:3129; Smith et al, 
Proc. Natl. Acad. Sci. (USA) (1985) 52:8844; Miyajima et al, Gene (1987) 55:273; and 

20 Martin et al, DNA (1988) 7:99. Numerous baculoviral strains and variants and 

corresponding permissive insect host cells from hosts are described in Luckow et al, 
Bio/Technology (1988) 6:47-55, Miller et al, Generic Engineering (1986) 5:277-279, and 
Maeda et al, Nature (1985) 375:592-594. 

Mammalian Cells. Mammalian expression is accomplished as described in Dijkema 

25 et al, EMBO J. ( 1 985) 4:16 1 , Gorman et al, Proc. Natl. Acad. Sci. (USA) ( 1 982) 7P:6777, 
Boshart et al, Cell (1985) 47:521 and U.S. Patent No. 4,399,216. Other features of 
mammalian expression are facilitated as described in Ham and Wallace, Meth. Enz. (1979) 
55:44, Barnes and Sato,^«a/. Biochem. (1980) 702:255, U.S. Patent Nos. 4,767,704, 
4,657,866, 4,927,762, 4,560,655, WO 90/103430, WO 87/00195, and U.S. RE 30,985. 

30 Polynucleotide molecules comprising a polynucleotide sequence provided herein 

propagated by placing the molecule in a vector. Viral and non-viral vectors are used, 
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including plasmids. The choice of plasmid will depend on the type of cell in which 

propagation is desired and the purpose of propagation. Certain vectors are useful for 

amplifying and making large amounts of the desired DNA sequence. Other vectors are 

suitable for expression in cells in culture. Still other vectors are suitable for transfer and 

5 expression in cells in a whole animal or person. The choice of appropriate vector is well 

within the skill of the art. Many such vectors are available commercially. The partial or 

full-length polynucleotide is inserted into a vector typically by means of DNA ligase 

attachment to a cleaved restriction enzyme site in the vector. Alternatively, the desired 

nucleotide sequence can be inserted by homologous recombination in vivo. Typically this is 

10 accomplished by attaching regions of homology to the vector on the flanks of the desired 
nucleotide sequence. Regions of homology are added by ligation of oligonucleotides, or by 
polymerase chain reaction using primers comprising both the region of homology and a 
portion of the desired nucleotide sequence, for example. 

The polynucleotides set forth in SEQ ID NOS: 1-844 or their corresponding full- 

15 length polynucleotides are linked to regulatory sequences as appropriate to obtain the desired 
expression properties. These can include promoters (attached either at the 5' end of the sense 
strand or at the 3' end of the antisense strand), enhancers, terminators, operators, repressors, 
and inducers. The promoters can be regulated or constitutive. In some situations it may be 
desirable to use conditionally active promoters, such as tissue-specific or developmental 

20 stage-specific promoters. These are linked to the desired nucleotide sequence using the 
techniques described above for linkage to vectors. Any techniques known in the art can be 
* used. 

When any of the above host cells, or other appropriate host cells or organisms, are 
used to replicate and/or express the polynucleotides or nucleic acids of the invention, the 
25 resulting replicated nucleic acid, RNA, expressed protein or polypeptide, is within the scope 
of the invention as a product of the host cell or organism. The product is recovered by any 
appropriate means known in the art. 

Once the gene corresponding to a selected polynucleotide is identified, its expression 
can be regulated in the cell to which the gene is native. For example, an endogenous gene of 
30 a cell can be regulated by an exogenous regulatory sequence as disclosed in U.S. Patent No. 
5,641,670. 
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III. Identification of Functional and Structural Motifs of Novel Genes 

A. Screening Polynucleotide Sequences and Amino Acid Sequences Against 
Publicly Available Databases 
5 Translations of the nucleotide sequence of the provided polynucleotides, cDNAs or 

full genes can be aligned with individual known sequences. Similarity with individual 
sequences can be used to determine the activity of the polypeptides encoded by the 
polynucleotides of the invention. For example, sequences that show similarity with a 
chemokine sequence can exhibit chemokine activities. Also, sequences exhibiting similarity 

10 with more than one individual sequence can exhibit activities that are characteristic of either 
or both individual sequences. 

The full length sequences and fragments of the polynucleotide sequences of the 
nearest neighbors can be used as probes and primers to identify and isolate the full length 
sequence corresponding to provided polynucleotides. The nearest neighbors can indicate a 

15 tissue or cell type to be used to construct a library for the full-length sequences 
corresponding to the provided polynucleotides.. 

Typically, a selected polynucleotide is translated in all six frames to determine the 
best alignment with the individual sequences. The sequences disclosed herein in the 
Sequence Listing are in a 5' to 3 5 orientation and translation in three frames can be sufficient 

20 (with a few specific exceptions as described in the Examples). These amino acid sequences 
are referred to, generally, as query sequences, which will be aligned with the individual 
sequences. Databases with individual sequences are described in "Computer Methods for 
Macromolecular Sequence Analysis" Methods in Enzymology (1996) 266, Doolittle, 
Academic Press, Inc., a division of Harcourt Brace & Co., San Diego, California, USA. 

25 Databases include Genbank, EMBL, and DNA Database of Japan (DDBJ). 

Query and individual sequences can be aligned using the methods and computer 
programs described above, and include BLAST, available over the world wide web at 
http://ww.ncbi.nlm.nih.gov/BLAST/ . Another alignment algorithm is Fasta, available in the 
Genetics Computing Group (GCG) package, Madison, Wisconsin, USA, a wholly owned 

30 subsidiary of Oxford Molecular Group, Inc. Other techniques for alignment are described in 
Doolittle, supra. Preferably, an alignment program that permits gaps in the sequence is 
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utilized to align the sequences. The Smith- Waterman is one type of algorithm that permits 

gaps in sequence alignments. See Meth Mol Biol (1997) 70: 173-187. Also, the GAP 

program using the Needleman and Wunsch alignment method can be utilized to align 

sequences. An alternative search strategy uses MPSRCH software, which runs on a 

5 MASPAR computer. MPSRCH uses a Smith-Waterman algorithm to score sequences on a 

massively parallel computer. This approach improves ability to identify sequences that are 

distantly related matches, and is especially tolerant of small gaps and nucleotide sequence 

errors. Amino acid sequences encoded by the provided polynucleotides can be used to 

search both protein and DNA databases. 

10 Results of individual and query sequence alignments can be divided into three 

categories, high similarity, weak similarity, and no similarity. Individual alignment results 
ranging from high similarity to weak similarity provide a basis for determining polypeptide 
activity and/or structure. Parameters for categorizing individual results include: percentage 
of the alignment region length where the strongest alignment is found, percent sequence 

1 5 identity, and p value. 

The percentage of the alignment region length is calculated by counting the number 
of residues of the individual sequence found in the region of strongest alignment, e.g., 
contiguous region of the individual sequence that contains the greatest number of residues 
that are identical to the residues of the corresponding region of the aligned query sequence. 

20 This number is divided by the total residue length of the query sequence to calculate a 

percentage. For example, a query sequence of 20 amino acid residues might be aligned with 
a 20 amino acid region of an individual sequence. The individual sequence might be 
identical to amino acid residues 5, 9-15, and 17-19 of the query sequence. The region of 
strongest alignment is thus the region stretching from residue 9-19, an 1 1 amino acid stretch. 

25 The percentage of the alignment region length is: 1 1 (length of the region of strongest 
alignment) divided by (query sequence length) 20 or 55%. 

Percent sequence identity is calculated by counting the number of amino acid 
matches between the query and individual sequence and dividing total number of matches by 
the number of residues of the individual sequences found in the region of strongest 

30 alignment. Thus, the percent identity in the example above would be 10 matches divided by 
1 1 amino acids, or approximately, 90.9% 
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P value is the probability that the alignment was produced by chance. For a single 

alignment, the p value can be calculated according to Karlin et al, Proc. Natl Acad. Set 

(1990) 57:2264 and Karlin et al., Proc. Natl. Acad. ScL (1993) 90. The p value of multiple 

alignments using the same query sequence can be calculated using an heuristic approach 

5 described in Altschul et al 9 Nat. Genet. (1994) 6:1 19. Alignment programs such as BLAST 

program can calculate the p value. 

Another factor to consider for determining identity or similarity is the location of the 

similarity or identity. Strong local alignment can indicate similarity even if the length of 

alignment is short. Sequence identity scattered throughout the length of the query sequence 

10 also can indicate a similarity between the query and profile sequences. The boundaries of 
the region where the sequences align can be determined according to Doolittle, supra; 
BLAST or FAST programs; or by determining the area where sequence identity is highest. 

High Similarity. In general, in alignment results considered to be of high similarity, 
the percent of the alignment region length is typically at least about 55% of total length 

15 query sequence; more typically, at least about 58%; even more typically; at least about 60% 
of the total residue length of the query sequence. Usually, percent length of the alignment 
region can be as much as about 62%; more usually, as much as about 64%; even more 
usually, as much as about 66%. Further, for high similarity, the region of alignment, 
typically, exhibits at least about 75% of sequence identity; more typically, at least about 

20 78%; even more typically; at least about 80% sequence identity. Usually, percent sequence 
identity can be as much as about 82%; more usually, as much as about 84%; even more 
usually, as much as about 86%. 

The p value is used in conjunction with these methods. If high similarity is found, 
the query sequence is considered to have high similarity with a profile sequence when the p 

25 value is less than or equal to about 10' 2 ; more usually; less than or equal to about 10" 3 ; even 
more usually; less than or equal to about 10" 4 . More typically, the p value is no more than 
about 10* 5 ; more typically; no more than or equal to about 10' 10 ; even more typically; no 
more than or equal to about 10' 15 for the query sequence to be considered high similarity. 
Weak Similarity. In general, where alignment results considered to be of weak 

30 similarity, there is no minimum percent length of the alignment region nor minimum length 
of alignment. A better showing of weak similarity is considered when the region of 
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alignment is, typically, at least about 15 amino acid residues in length; more typically, at 

least about 20; even more typically; at least about 25 amino acid residues in length. Usually, 

length of the alignment region can be as much as about 30 amino acid residues; more 

usually, as much as about 40; even more usually, as much as about 60 amino acid residues. 

5 Further, for weak similarity, the region of alignment, typically, exhibits at least about 35% of 

sequence identity; more typically, at least about 40%; even more typically; at least about 

45% sequence identity. Usually, percent sequence identity can be as much as about 50%; 

more usually, as much as about 55%; even more usually, as much as about 60%. 

If low similarity is found, the query sequence is considered to have weak similarity 

10 with a profile sequence when the p value is usually less than or equal to about 1 0' 2 ; more 
usually; less than or equal to about 10' 3 ; even more usually; less than or equal to about 10 -4 . 
More typically, the p value is no more than about 10' 5 ; more usually; no more than or equal 
to about 10" 10 ; even more usually; no more than or equal to about 10" 15 for the query sequence 
to be considered weak similarity. 

1 5 Similarity Determined by Sequence Identity Alone. Sequence identity alone can be 

used to determine similarity of a query sequence to an individual sequence and can indicate 
the activity of the sequence. Such an alignment, preferably, permits gaps to align sequences. 
Typically, the query sequence is related to the profile sequence if the sequence identity over 
the entire query sequence is at least about 15%; more typically, at least about 20%; even 

20 more typically, at least about 25%; even more typically, at least about 50%. Sequence 

identity alone as a measure of similarity is most useful when the query sequence is usually, 
at least 80 residues in length; more usually, 90 residues; even more usually, at least 95 amino 
acid residues in length. More typically, similarity can be concluded based on sequence 
identity alone when the query sequence is preferably 100 residues in length; more preferably, 

25 120 residues in length; even more preferably, 1 50 amino acid residues in length. 

Determining Activity from Alignments with Profile and Multiple Aligned Sequences. 
Translations of the provided polynucleotides can be aligned with amino acid profiles that 
define either protein families or common motifs. Also, translations of the provided 
polynucleotides can be aligned to multiple sequence alignments (MSA) comprising the 

30 polypeptide sequences of members of protein families or motifs. Similarity or identity with 
profile sequences or MSAs can be used to determine the activity of the gene products (e.g., 
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polypeptides) encoded by the provided polynucleotides or corresponding cDNA or genes. 

For example, sequences that show an identity or similarity with a chemokine profile or MSA 

can exhibit chemokine activities. 

Profiles can designed manually by (1) creating an MSA, which is an alignment of the 

5 amino acid sequence of members that belong to the family and (2) constructing a statistical 

representation of the alignment. Such methods are described, for example, in Birney et al., 

Nucl Acid Res. (1996) 24(14): 2730-2739. MSAs of some protein families and motifs are 

publicly available. For example, http://genome.wustl.edu/Pfam/ includes MSAs of 547 

different families and motifs. These MSAs are described also in Sonnhammer et al., 

10 Proteins (1997) 28: 405-420. Other sources over the world wide web include the site at 
http://www.embl-heidelberg.de/argos/ali/ali.htm 1 : alternatively, a message can be sent to 
ALI@EMBL-HEIDELBERG.DE for the information. A brief description of these MSAs is 
reported in Pascarella et al, Prot. Eng. (1996) 9(3):249~25\. Techniques for building 
profiles from MSAs are described in Sonnhammer et al, supra; Birney et al, supra; and 

15 "Computer Methods for Macromolecular Sequence Analysis," Methods in Enzymology 

(1996) 266, Doolittle, Academic Press, Inc., a division of Harcourt Brace & Co., San Diego, 
California, USA. 

Similarity between a query sequence and a protein family or motif can be determined 
by (a) comparing the query sequence against the profile and/or (b) aligning the query 
20 sequence with the members of the family or motif. Typically, a program such as Searchwise 
is used to compare the query sequence to the statistical representation of the multiple 
alignment, also known as a profile. The program is described in Birney et al, supra. Other 
techniques to compare the sequence and profile are described in Sonnhammer et al, supra 
and Doolittle, supra. 

25 Next, methods described by Feng et al, J. Mol Evol. (1987) 25:351 and Higgins et 

al, CABIOS (1989) 5:151 can be used align the query sequence with the members of a 
family or motif, also known as a MSA. Computer programs, such as PILEUP, can be used. 
See Feng et al., infra. In general, the following factors are used to determine if a similarity 
between a query sequence and a profile or MSA exists: (1) number of conserved residues 

30 found in the query sequence, (2) percentage of conserved residues found in the query 
sequence, (3) number of frameshifts, and (4) spacing between conserved residues. 
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Some alignment programs that both translate and align sequences can make any 

number of frameshifts when translating the nucleotide sequence to produce the best 

alignment. The fewer frameshifts needed to produce an alignment, the stronger the 

similarity or identity between the query and profile or MSAs. For example, a weak 

5 similarity resulting from no frameshifts can be a better indication of activity or structure of a 

query sequence, than a strong similarity resulting from two frameshifts. Preferably, three or 

fewer frameshifts are found in an alignment; more preferably two or fewer frameshifts; even 

more preferably, one or fewer frameshifts; even more preferably, no frameshifts are found in 

an alignment of query and profile or MSAs. 

10 Conserved residues are those amino acids found at a particular position in all or some 

of the family or motif members. For example, most chemokines contain four conserved 
cysteines. Alternatively, a position is considered conserved if only a certain class of amino 
acids is found in a particular position in all or some of the family members. For example, 
the N-terminal position can contain a positively charged amino acid, such as lysine, arginine, 

15 or histidine. 

Typically, a residue of a polypeptide is conserved when a class of amino acids or a 

single amino acid is found at a particular position in at least about 40% of all class members; 

more typically, at least about 50%; even more typically, at least about 60% of the members. 

Usually, a residue is conserved when a class or single amino acid is found in at least about 
20 70% of the members of a family or motif; more usually, at least about 80%; even more 

usually, at least about 90%; even more usually, at least about 95%. 

A residue is considered conserved when three unrelated amino acids are found at a 

particular position in the some or all of the members; more usually, two unrelated amino 

acids. These residues are conserved when the unrelated amino acids are found at particular 
25 positions in at least about 40% of all class member; more typically, at least about 50%; even 

more typically, at least about 60% of the members. Usually, a residue is conserved when a 

class or single amino acid is found in at least about 70% of the members of a family or motif; 

more usually, at least about 80%; even more usually, at least about 90%; even more usually, 

at least about 95%. 

30 A query sequence has similarity to a profile or MSA when the query sequence 

comprises at least about 25% of the conserved residues of the profile or MSA; more usually, 
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at least about 30%; even more usually; at least about 40%. Typically, the query sequence 

has a stronger similarity to a profile sequence or MSA when the query sequence comprises at 

least about 45% of the conserved residues of the profile or MSA; more typically, at least 

about 50%; even more typically; at least about 55%. 

5 B. Screening Polynucleotide and Amino Acid Sequences Against Protein 

Profiles 

The identify and function of the gene that correlates to a polynucleotide described 
herein can be determined by screening the polynucleotides or their corresponding amino acid 
sequences against profiles of protein families. Such profiles focus on common structural 

10 motifs among proteins of each family. Publicly available profiles are described above in 
Section IVA. Additional or alternative profiles are described below. 

In comparing a novel polynucleotide with known sequences, several alignment tools 
are available. Examples include PileUp, which creates a multiple sequence alignment, and is 
described in Feng et aL, J. Mol Evol. (1987) 25:351. Another method, GAP, uses the 

15 alignment method of Needleman et aL, J. Mol Biol (1970) 45:443. GAP is best suited for 
global alignment of sequences. A third method, BestFit, functions by inserting gaps to 
maximize the number of matches using the local homology algorithm of Smith et al.Adv. 
Appl Math (1981) 2:482. Exemplary protein profiles are provided below and in the 
examples. 

20 Chemokines. Chemokines are a family of proteins that have been implicated in 

lymphocyte trafficking, inflammatory diseases, angiogenesis, hematopoiesis, and viral 
infection. See, for example, Rollins, Blood (1997) 90(3) :909-928, and Wells et aL, 1 Leuk. 
Biol (1997) 67:545-550. U.S. Patent No. 5,605,817 discloses DNA encoding a chemokine 
expressed in fetal spleen. U.S. Patent No. 5,656,724 discloses chemokine-like proteins and 

25 methods of use. U.S. Patent No. 5,602,008 discloses DNA encoding a chemokine expressed 
by liver. 

Chemokine mutants are polypeptides having an amino acid sequence that possesses 
at least one amino acid substitution, addition, or deletion as compared to native chemokines. 
Fragments possess the same amino acid sequence of the native chemokines; mutants can 
30 lack the amino and/or carboxyl terminal sequences. Fusions are mutants, fragments, or 

native chemokines that also include amino and/or carboxyl terminal amino acid extensions. 
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The number or type of the amino acid changes is not critical, nor is the length or 

number of the amino acid deletions, or amino acid extensions that are incorporated in the 

chemokines as compared to the native chemokine amino acid sequences. A polynucleotide 

encoding one of these variant polypeptides will retain at least about 80% amino acid identity 

5 with at least one known chemokine. Preferably, these polypeptides will retain at least about 

85% amino acid sequence identity, more preferably, at least about 90%; even more 

preferably, at least about 95%. In addition, the variants exhibit at least 80%; preferably 

about 90%; more preferably about 95% of at least one activity exhibited by a native 

chemokine, which includes immunological, biological, receptor binding, and signal 

10 transduction functions. 

Assays for chemotaxis relating to neutrophils are described in Walz et al, Biochem. 

Biophys. Res. Commun. (1987) 149:155, Yoshimura et al, Proc. Natl Acad. Set (USA) 

(1987) 84:9233, and Schroder et al % J. Immunol (1987) 759:3474; to lymphocytes, Larsen et 
al, Science (1989) 245:1464, Carr et al, Proc. Natl Acad Scl (USA) (1994) 91:3652; to 

15 tumor-infiltrating lymphocytes, Liao et al, J, Exp. Med (1995). 752:1301; to hematopoietic 
progenitors, Aiuti et al, J. Exp. Med. (1997) 755:1 1 1; to monocytes, Valente et al, Biochem. 

(1988) 27:4162; and to natural killer cells, Loetscher et al, J. Immunol (1996) 156:322, and 
Allavena et al, Eur, 1 Immunol. (1994) 24:3233. 

Assays for determining the biological activity of attracting eosinophils are described 
20 in Dahinden et al, J. Exp. Med (1994) 779:751, Weber et al, J. Immunol. (1995) 754:4166, 
and Noso et al, Biochem. Biophys. Res. Commun. (1994) 200:1470; for attracting dendritic 
cells, Sozzani et al, J. Immunol (1995) 755:3292; for attracting basophils, in Dahinden et 
al, J. Exp. Med (1994) 779:751, Alam et al, J. Immunol (1994) 752:1298, Alam et al, J. 
Exp. Med. (1992) 775:781; and for activating neutrophils, Maghazaci et al, Eur. J. Immunol 
25 (1996) 26:3 1 5, and Taub et al, J. Immunol (1995) 755:3877. Native chemokines can act as 
mitogens for fibroblasts, assayed as described in Mullenbach et al, J. Biol Chem. (1986) 
257:719. 

Native chemokines exhibit binding activity with a number of receptors. Description 
of such receptors and assays to detect binding are described in, for example, Murphy et al, 
30 Science (1991) 255:1280; Combadiere et al, J. Biol. Chem. (1995) 270:29671; Daugherty et 
al, J. Exp. Med. (1996) 755:2349; Samson et al, Biochem. (1996) 55:3362; Raport et al, J. 
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Biol Chem. (1996)277:17161; Combadiere et al, J. Leukoc. Biol (1996) 60:147; Babae/ 

al,J. Biol Chem. (1997) 25:14893; Yosida etal,J. Biol Chem. (1997)272:13803; 
Arvannitakis et al, Nature (1997) 555:347, and other assays are known in the art. 

Assays for kinase activation of chemokines are described by Yen et al, J. Leukoc. 
5 Biol (1997) (57:529; Dubois et al, J. Immunol. (1996) 756:1356; Turner et al, 1 Immunol 

(1995) 755:2437. Assays for inhibition of angiogenesis or cell proliferation are described in 
Maione et al, Science (1990) 247:11, Glycosaminoglycan production can be induced by 
native chemokines, assayed as described in Castor et al, Proc. Natl Acad. Sci. (USA) (1983) 
50:765. Chemokine-mediated histamine release from basophils is assayed as described in 

1 0 Dahinden et al, J. Exp. Med. ( 1 989) 7 70: 1 787; and White et al, Immunol Lett ( 1 989) 
22:151. Heparin binding is described in Luster et al, J. Exp. Med. (1995) 752:219. 

Chemokines can possess dimerization activity, which can be assayed according to 
Burrows et al, Biochem. (1994) 55:12741; and Zhang et al, Mol Cell Biol (1995) 75:4851. 
Native chemokines can play a role in the inflammatory response of viruses. This activity 

15 can be assayed as described in Bleul et al, Nature (1996) 552:829; and Oberlin et al, Nature 

(1996) 552:833. Exocytosis of monocytes can be promoted by native chemokines. The 
assay for such activity is described in Uguccioni et al, Eur. J. Immunol (1995) 25:64. 
Native chemokines also can inhibit hematopoietic stem cell proliferation. The method for 
testing for such activity is reported in Graham et al, Nature (1990) 544:442. 

20 Death Domain Proteins, Several protein families contain death domain motifs 

(Feinstein and Kimchi, TIBS Letters (1995) 20:242). Some death domain containing 
proteins are implicated in cytotoxic intracellular signaling (Cleveland et al, Cell (1995) 
57:479, Pan et al, Science (1997) 276:1 1 1; Duan et al, Nature (1997) 555:86-89, and 
Chinnaiyan et al, Science (1996) 274:990). U.S. Patent No. 5,563,039 describes a protein 

25 homologous to TRADD (Tumor Necrosis Factor Receptor- 1 Associated Death Domain 
containing protein), and modifications of the active domain of TRADD that retain the 
functional characteristics of the protein, as well as apoptosis assays for testing the function of 
such death domain containing proteins. U.S. Patent No. 5,658,883 discloses biologically 
active TGF-B1 peptides. U.S. Patent No. 5,674,734 discloses RIP, which contains a C- 

30 terminal death domain and an N-terminal kinase domain. 
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Leukemia Inhibitory Factor (LIFV An LIF profile is constructed from sequences of 

leukemia inhibitor factor, CT-1 (cardiotrophin-1), CNTF (ciliary neurotrophic factor), OSM 

(oncostatin M), and IL-6 (interleukin-6). This profile encompasses a family of secreted 

cytokines that have pleiotropic effects on many cell types including hepatocytes, osteoclasts, 

5 neuronal cells and cardiac myocytes, and can be used to detect additional genes encoding 

such proteins. These molecules are all structurally related and share a common co-receptor 

gpl30 which mediates intracellular signal transduction by cytoplasmic tyrosine kinases such 

as src. 

Novel proteins related to this family are also likely to be secreted, to activate gpl30 

1 0 and to function in the development of a variety of cell types. Thus new members of this 
family would be candidates to be developed as growth or survival factors for the cell types 
that they stimulate. For more details on this family of cytokines, see Pennica et al, Cytokine 
and Growth Factor Reviews (1996) 7:81-91. U.S. Patent No. 5,420,247 discloses LIF 
receptor and fusion proteins. U.S. Patent No. 5,443,825 discloses human LIF. 

15 Angiopoietin. Angiopoietin-1 is a secreted ligand of the TIE-2 tyrosine kinase; it 

functions as an angiogenic factor critical for normal vascular development. Angiopoietin-2 is 
a natural antagonist of angiopoietin-1 and thus functions as an anti -angiogenic factor. These 
two proteins are structurally similar and activate the same receptor (Folkman et al, Cell 
(1996) 57:1 153, and Davis eta!., Cell (1996) 57:1161). The angiopoietin molecules are 

20 composed of two domains: a coiled-coil region and a region related to fibrinogen. The 
fibrinogen domain is found in many molecules including ficolin and tesascin, and is well 
defined structurally with many members. 

Receptor Protein-Tvrosine Kinases. Receptor Protein-Tyrosine Kinases or RPTKs 
are described in Lindberg, Annu. Rev. Cell Biol (1994) 70:251-337. 

25 Growth Factors: (Epidermal Growth Factor) EGF and (Fibroblast Growth Factor) 

FGF. For a discussion of growth factor superfamilies, see Growth Factors: A Practical 
Approach, (Appendix Al) (1993) McKay and Leigh, Oxford University Press, NY, 237-243. 
U.S. Patent No. 4,444,760 discloses acidic brain fibroblast growth factor, which is active in 
the promotion of cell division and wound healing. U.S. Patent No. 5,439,818 discloses DNA 

30 encoding human recombinant basic fibroblast growth factor, which is active in wound 
healing. U.S. Patent No. 5,604,293 discloses recombinant human basic fibroblast growth 
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factor, which is useful for wound healing. U.S. Patent No. 5,41 0,832 discloses brain-derived 

and recombinant acidic fibroblast growth factor, which act as mitogens for mesoderm and 

neuroectoderm-derived cells in culture, and promote wound healing in soft tissue, 

cartilaginous tissue and musculoskeletal tissue. U.S. Patent No. 5,387,673 discloses 

5 biologically active fragments of FGF. 

Proteins of the TNF Family. A profile derived from the TNF family is created by 

aligning sequences of the following TNF family members: nerve growth factor (NGF), 

lymphotoxin, Fas ligand, tumor necrosis factor (TNFa), CD40 ligand, TRAIL, ox40 ligand, 

4-1 BB ligand, CD27 ligand, and CD30 ligand. The profile is designed to identify sequences 

10 of proteins that constitute new members or homologues of this family of proteins. U.S. 
Patent No. 5,606,023 discloses mutant TNF proteins; U.S. Patent No. 5,597,899 and U.S. 
Patent No. 5,486,463 disclose TNF muteins; and U.S. Patent No. 5,652,353 discloses DNA 
encoding TNFa muteins. 

Members of the TNF family of proteins have been show in vitro to multimerize, as 

1 5 described in Burrows et al , Biochem. ( 1 994) 55: 1 274 1 and Zhang et al , Mol Cell Biol 
(1995) 75:4851 and bind receptors as described in Browning et al,J. Immunol (1994) 
147:1230, Androlewicz et al., J. Biol Chem.(\992) 267:2542, and Crowe et al, Science 
(1994) 25*707. 

In vivo, TNFs proteolytically cleave a target protein as described in Kriegel et al, 

20 Cell (1988) 55:45 and Mohler et al, Nature (1994) 570:218 and demonstrate cell 

proliferation and differentiation activity. T-cell or thymocyte proliferation is assayed as 
described in Armitage et al, Eur. J. Immunol. (1992) 22:447; Current Protocols in 
Immunology, ed. J.E. Coligan et al, 3.1-3.19; Takai et al, J. Immunol (1986) 757:3494- 
3500, Bertagnoli et al, J. Immunol (1990) 145:1706, Bertagnoli et al, J. Immunol (1991) 

25 755:327, Bertagnoli et al, J. Immunol (1992) 149:311%, and Bowman et al, J. Immunol. 
(1994) 752:1756. B cell proliferation and Ig secretion are assayed as described in 
Maliszewski, J. Immunol (1990) 744:3028, and Assays for B Cell Function: In Vitro 
Antibody Production . Mond and Brunswick, Current Protocols in Immunol., Coligan Ed vol 
1 pp 3.8.1-3.8.16, John Wiley and Sons, Toronto 1994, Kehrl et al, Science (1987) 255:1 144 

30 and Boussiotis et al, PNAS USA (1994) 97:7007. Other in vivo activities include 

upregulation of cell surface antigens, upregulation of costimulatory molecules, and cellular. 
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aggregation/adhesion as described in Barrett etal,J. Immunol. (1991) 746:1722; Bjorck et 

al,Eur. J. Immunol (1993)23:1771; Clark etal,Annu Rev. Immunol. (1991)9:97; 

Ranheim et al,J. Exp. Med. (1994) 777:925; Yellin, J. Immunol. (1994) 755:666; and Gross 

etal, Blood (1994) 5-/:2305. 

5 Proliferation and differentiation of hematopoietic and lymphopoietic cells has also 

been shown in vivo for TNFs, using assays for embryonic differentiation and hematopoiesis 

as described in Johansson et al, Cellular Biology (1995) 75:141, Keller et al, Mol Cell 

Biol. (1993) 75:473, McClanahan et al, Blood (1993) 57:2903 and using assays to detect 

stem cell survival and differentiation as described in Culture of Hematopoietic Cells, 

10 Freshney et al eds, pp 1-21, 23-29, 139-162, 163-179, and 265-268, Wiley-Liss, Inc., New 
York, NY, 1994, and Hirajama et al, PNAS USA (1992) 59:5907. 

In vivo activities of TNFs also include lymphocyte survival and apoptosis, assayed as 
described in Darzynkewicz et al, Cytometry (1992) 75:795; Gorczca et al, Leukemia (1993) 
7:659; Itoh et al., Cell (1991) 66:233; Zacharduk, J. Immunol. (1990) 745:4037; Zamai et 

15 al, Cytometry (1993) 1 4:m; and Gorczycaetal,Int r U. Oncol (1992) 7:639. Some 
members of the TNF family are cleaved from the cell surface; others remain membrane 
bound. The three-dimensional structure of TNF is discussed in Sprang and Eck, Tumor 
Necrosis Factors; supra. 

TNF proteins include a transmembrane domain. The protein is cleaved into a shorter 

20 soluble version, as described in Kriegler et al, Cell (1988) 55:45, Perez et al, Cell (1990) 
65:251, and Shaw et al, Cell (1986) 46:659. The transmembrane domain is between amino 
acid 46 and 77 and the cytoplasmic domain is between position 1 and 45 on the human form 
of TNFa. The 3-dimensional motifs of TNF include a sandwich of two pleated p sheets. 
Each sheet is composed of anti-parallel p strands. P strands facing each other on opposite 

25 sites of the sandwich are connected by short polypeptide loops, as described in Van Ostade et 
al, Protein Engineering (1994) 7(1):5, and Sprang et al, Tumor Necrosis Factors; supra. 
Residues of the TNF family proteins that are involved in the p sheet secondary structure 
have been identified as described in Van Ostade et al, Protein Eng. (1994) 7(I):5, and 
Sprang et al, supra. 

30 TNF receptors are disclosed in U.S. Patent No. 5,395,760. A profile derived from the 

TNF receptor family is created by aligning sequences of the TNF receptor family, including 
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Apol/Fas, TNFR I and II, death receptor 3 (DR3), CD40, ox40, CD27, and CD30. Thus, the 

profile is designed to identify from the polynucleotides of the invention sequences of 

proteins that constitute new members or homologues of this family of proteins. 

Tumor necrosis factor receptors exist in two forms in humans: p55 TNFR and p75 

5 TNFR, both of which provide intracellular signals upon binding with a ligand. The 

extracellular domains of these receptor proteins are cysteine rich. The receptors can remain 

membrane bound, although some forms of the receptors are cleaved forming soluble 

receptors. The regulation, diagnostic, prognostic, and therapeutic value of soluble TNF 

receptors is discussed in Aderka, Cytokine and Growth Factor Reviews, (1996) 7(3) :23\. 

10 PDGF Family. U.S. Patent No. 5,326,695 discloses platelet derived growth factor 

agonists; bioactive portions of PDGF-B are used as agonists. U.S. Patent No. 4,845,075 
discloses biologically active B-chain homodimers, and also includes variants and derivatives 
of the PDGF-B chain. U.S. Patent No. 5,128,321 discloses PDGF analogs and methods of 
use. Proteins having the same bioactivity as PDGF are disclosed, including A and B chain 

15 proteins. 

Kinase (Including MKK) Family. U.S. Patent No. 5,650,501 discloses 
serine/threonine kinase, associated with mitotic and meiotic cell division; the protein has a 
kinase domain in its N-terminal and 3 PEST regions in the C-terminus. U.S. Patent No. 
5,605,825 discloses human PAK65, a serine protein kinase. 

20 The foregoing discussion provides a few examples of the protein profiles that can be 

compared with the polynucleotides of the invention. One skilled in the art can use these and 
other protein profiles to identify the genes that correlate with the provided polynucleotides. 
C. Identification of Secreted & Membrane-Bound Polypeptides 
Both secreted and membrane-bound polypeptides of the present invention are of 

25 particular interest. For example, levels of secreted polypeptides can be assayed in body 

fluids that are convenient, such as blood, urine, prostatic fluid and semen. Membrane-bound 
polypeptides are useful for constructing vaccine antigens or inducing an immune response. 
Such antigens would comprise all or part of the extracellular region of the membrane-bound 
polypeptides. Because both secreted and membrane-bound polypeptides comprise a 

30 fragment of contiguous hydrophobic amino acids, hydrophobicity predicting algorithms can 
be used to identify such polypeptides. 

29 



WO 99/33982 PCT/US98/2761 0 

A signal sequence is usually encoded by both secreted and membrane-bound 

polypeptide genes to direct a polypeptide to the surface of the cell. The signal sequence 

usually comprises a stretch of hydrophobic residues. Such signal sequences can fold into 

helical structures. Membrane-bound polypeptides typically comprise at least one 

5 transmembrane region that possesses a stretch of hydrophobic amino acids that can 

transverse the membrane. Some transmembrane regions also exhibit a helical structure. 

Hydrophobic fragments within a polypeptide can be identified by using computer 

algorithms. Such algorithms include Hopp & Woods a Proc. Natl Acad. Sci. USA (1981) 

75:3824-3828; Kyte & Doolittle, J. Mol Biol (1982) 157: 105-132; and RAOAR algorithm, 

1 0 Degli Esposti et al, Eur. 1 Biochem. ( 1 990) 790: 207-2 1 9. 

Another method of identifying secreted and membrane-bound polypeptides is to 
translate the polynucleotides of the invention in all six frames and determine if at least 8 
contiguous hydrophobic amino acids are present. Those translated polypeptides with at least 
8; more typically, 10; even more typically, 12 contiguous hydrophobic amino acids are 

1 5 considered to be either a putative secreted or membrane bound polypeptide. Hydrophobic 
amino acids include alanine, glycine, histidine, isoleucine, leucine, lysine, methionine, 
phenylalanine, proline, threonine, tryptophan, tyrosine, and valine. 

IV. Identification of the Function of an Expression Product of a Full-Length Gene 

20 Corresponding to a Polynucleotide 

Ribozymes, antisense constructs, and dominant negative mutants can be used to 
determine function of the expression product of a gene corresponding to a polynucleotide 
provided herein. These methods and compositions are particularly useful where the provided 
novel polynucleotide exhibits no significant or substantial homology to a sequence encoding 

25 a gene of known function. Antisense molecules and ribozymes can be constructed from 
synthetic polynucleotides. Typically, the phosphoramidite method of oligonucleotide 
synthesis is used. See Beaucage et aL y Tet. Lett. (1981) 22:1859 and U.S. Patent No. 
4,668,777. Automated devices for synthesis are available to create oligonucleotides using 
this chemistry. Examples of such devices include Biosearch 8600, Models 392 and 394 by 

30 Applied Biosystems, a division of Perkin-Elmer Corp., Foster City, California, USA; and 
Expedite by Perceptive Biosystems, Framingham, Massachusetts, USA. Synthetic RNA, 
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phosphate analog oligonucleotides, and chemically derivatized oligonucleotides can also be 

produced, and can be covalently attached to other molecules. RNA oligonucleotides can be 

synthesized, for example, using RNA phosphoramidites. This method can be performed on 

an automated synthesizer, such as Applied Biosystems, Models 392 and 394, Foster City, 

5 California, USA. See Applied Biosystems User Bulletin 53 and Ogilvie et al, Pure & 

Applied Chem. (1987) 59:325. 

Phosphorothioate oligonucleotides can also be synthesized for antisense construction. 
A sulfurizing reagent, such as tetraethylthiruam disulfide (TETD) in acetonitrile can be used 
to convert the internucleotide cyanoethyl phosphite to the phosphorothioate triester within 15 
10 minutes at room temperature. TETD replaces the iodine reagent, while all other reagents 
used for standard phosphoramidite chemistry remain the same. Such a synthesis method can 
be automated using Models 392 and 394 by Applied Biosystems, for example. 

Oligonucleotides of up to 200 nucleotides can be synthesized, more typically, 100 
nucleotides, more typically 50 nucleotides; even more typically 30 to 40 nucleotides. These 
1 5 synthetic fragments can be annealed and ligated together to construct larger fragments. See, 
for example, Sambrook et al, supra. 

A. Ribozvmes 

Trans-cleaving catalytic RNAs (ribozymes) are RNA molecules possessing 
endoribonuclease activity. Ribozymes are specifically designed for a particular target, and 

20 the target message must contain a specific nucleotide sequence. They are engineered to 
cleave any RNA species site-specifically in the background of cellular RNA. The cleavage 
event renders the mRNA unstable and prevents protein expression. Importantly, ribozymes 
can be used to inhibit expression of a gene of unknown function for the purpose of 
determining its function in an in vitro or in vivo context, by detecting the phenotypic effect. 

25 One commonly used ribozyme motif is the hammerhead, for which the substrate 

sequence requirements are minimal. Design of the hammerhead ribozyme is disclosed in 
Usman et al, Current Opin. Struct, Biol. (1996) 6:527. Usman also discusses the 
therapeutic uses of ribozymes. Ribozymes can also be prepared and used as described in 
Long etal, FASEB1 (1993) 7:25; SymonMw*. Rev. Biochem. (1992) 67:641; Perrotta et 

30 al., Biochem. (1992) 57:16; Ojwang et al,Proc. Natl. Acad. Sci. (USA) (1992) 59:10802; 
and U.S. Patent No. 5,254,678. Ribozyme cleavage of HIV-I RNA is described in U.S. 
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Patent No. 5,144,019; methods of cleaving RNA using ribozymes is described in U.S. 

Patent No. 5,1 16,742; and methods for increasing the specificity of ribozymes are described 

in U.S. Patent No. 5,225,337 and Koizumi et al. y Nucleic Acid Res. (1989) 7 7:7059. 

Preparation and use of ribozyme fragments in a hammerhead structure are also described by 

5 Koizumi et al, Nucleic Acids Res. (1 989) 1 7:7059. Preparation and use of ribozyme 

fragments in a hairpin structure are described by Chowrira and Burke, Nucleic Acids Res. 

(1992) 20:2835. Ribozymes can also be made by rolling transcription as described in 

Daubendiek and Kool, Nat. Biotechnol (1997) 1 5(3) :273. 

The hybridizing region of the ribozyme can be modified or can be prepared as a 

10 branched structure as described in Horn and Urdea, Nucleic Acids Res. (1989) 1 7:6959. The 
basic structure of the ribozymes can also be chemically altered in ways familiar to those 
skilled in the art, and chemically synthesized ribozymes can be administered as synthetic 
oligonucleotide derivatives modified by monomeric units. In a therapeutic context, liposome 
mediated delivery of ribozymes improves cellular uptake, as described in Birikh et al, Eur. 

15 J. Biochem. (1997) 245 :\. 

Using the polynucleotide sequences of the invention and methods known in the art, 
ribozymes are designed to specifically bind and cut the corresponding mRNA species. 
Ribozymes thus provide a means to inhibit the expression of any of the proteins encoded by 
the disclosed polynucleotides or their full-length genes. The full-length gene need not be 

20 known in order to design and use specific inhibitory ribozymes. In the case of a 

polynucleotide or full-length cDNA of unknown function, ribozymes corresponding to that 
nucleotide sequence can be tested in vitro for efficacy in cleaving the target transcript. 
Those ribozymes that effect cleavage in vitro are further tested in vivo. The ribozyme can 
also be used to generate an animal model for a disease, as described in Birikh et al, supra. 

25 An effective ribozyme is used to determine the function of the gene of interest by blocking 
its transcription and detecting a change in the cell. Where the gene is found to be a mediator 
in a disease, an effective ribozyme is designed and delivered in a gene therapy for blocking 
transcription and expression of the gene. 

Therapeutic and functional genomic applications of ribozymes proceed beginning 

30 with knowledge of a portion of the coding sequence of the gene to be inhibited. Thus, for 
many genes, a partial polynucleotide sequence provides adequate sequence for constructing 
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an effective ribozyme. A target cleavage site is selected in the target sequence, and a 

ribozyme is constructed based on the 5' and 3' nucleotide sequences that flank the cleavage 

site. Retroviral vectors are engineered to express monomeric and multimeric hammerhead 

ribozymes targeting the mRNA of the target coding sequence. These monomeric and 

5 multimeric ribozymes are tested in vitro for an ability to cleave the target mRNA. A cell line 

is stably transduced with the retroviral vectors expressing the ribozymes, and the 

transduction is confirmed by Northern blot analysis and reverse-transcription polymerase 

chain reaction (RT-PCR). The cells are screened for inactivation of the target mRNA by 

such indicators as reduction of expression of disease markers or reduction of the gene 

1 0 product of the target mRNA. 

B. Antisense 

Antisense nucleic acids are designed to specifically bind to RNA, resulting in the 
formation of RNA-DNA or RNA-RNA hybrids, with an arrest of DNA replication, reverse 
transcription or messenger RNA translation. Antisense polynucleotides based on a selected 

1 5 polynucleotide sequence can interfere with expression of the corresponding gene. Antisense 
polynucleotides are typically generated within the cell by expression from antisense 
constructs that contain the antisense strand as the transcribed strand. Antisense 
polynucleotides based on the disclosed polynucleotides will bind and/or interfere with the 
translation of mRNA comprising a sequence complementary to the antisense polynucleotide. 

20 The expression products of control cells and cells treated with the antisense construct are 
compared to detect the protein product of the gene corresponding to the polynucleotide upon 
which the antisense construct is based. The protein is isolated and identified using routine 
biochemical methods. 

One rationale for using antisense methods to determine the function of the gene 

25 corresponding to a disclosed polynucleotide is the biological activity of antisense 

therapeutics. Antisense therapy for a variety of cancers is in clinical phase and has been 
discussed extensively in the literature. Reed reviewed antisense therapy directed at the Bcl-2 
gene in tumors; gene transfer-mediated overexpression of Bcl-2 in tumor cell lines conferred 
resistance to many types of cancer drugs. (Reed, J.C., N.C.I (1997) 59:988). The potential 

30 for clinical development of antisense inhibitors of ras is discussed by Cowsert, L.M., Anti- 
Cancer Drug Design (1997) 72:359. Additional important antisense targets include 
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leukemia (Geurtz, A.M., Anti-Cancer Drug Design (1997) 72:341); human C-ref kinase 

(Monia, B.P., Anti-Cancer Drug Design (1997) 72:327); and protein kinase C (McGraw et 

al, Anti-Cancer Drug Design (1997) 72:315. 

Given the extensive background literature and clinical experience in antisense 

5 therapy, one skilled in the art can use selected polynucleotides of the invention as additional 

potential therapeutics. The choice of polynucleotide can be narrowed by first testing them 

for binding to "hot spot" regions of the genome of cancerous cells. If a polynucleotide is 

identified as binding to a "hot spot", testing the polynucleotide as an antisense compound in 

the corresponding cancer cells clearly is warranted. 

10 Ogunbiyi et al, Gastroenterology (1997) 113(3)\16\ describe prognostic use of 

allelic loss in colon cancer; Barks et al, Genes, Chromosomes, and Cancer (1997) 19(4):27$ 
describe increased chromosome copy number detected by FISH in malignant melanoma; 
Nishizake et al., Genes, Chromosomes, and Cancer (1997) 19(4):261 describe genetic 
alterations in primary breast cancer and their metastases and direct comparison using 

1 5 modified comparative genome hybridization; and Elo et al, Cancer Research (1 997) 

57(7(^:3356 disclose that loss of heterozygosity at 16z24.1-q24.2 is significantly associated 
with metastatic and aggressive behavior of prostate cancer. 
C. Dominant Negative Mutations 

As an alternative method for identifying function of the gene corresponding to a 
20 polynucleotide disclosed herein, dominant negative mutations are readily generated for 

corresponding proteins that are active as homomultimers. A mutant polypeptide will interact 
with wild-type polypeptides (made from the other allele) and form a non-functional 
multimer. Thus, a mutation is in a substrate-binding domain, a catalytic domain, or a 
cellular localization domain. Preferably, the mutant polypeptide will be overproduced. 
25 Point mutations are made that have such an effect. In addition, fusion of different 

polypeptides of various lengths to the terminus of a protein can yield dominant negative 
mutants. General strategies are available for making dominant negative mutants (see, e.g., 
Herskowitz, Nature (1987) 329:219). Such techniques can be used to create loss of function 
mutations, which are useful for determining protein function. 

30 
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V. Construction of Polypeptides of the Invention and Variants Thereof 

The polypeptides of the invention include those encoded by the disclosed 

polynucleotides. These polypeptides can also be encoded by nucleic acids that, by virtue of 

the degeneracy of the genetic code, are not identical in sequence to the disclosed 

5 polynucleotides. Thus, the invention includes within its scope a polypeptide encoded by a 

polynucleotide having the sequence of any one of SEQ ID NOS: 1-844 or a variant thereof. 

In general, the term "polypeptide" as used herein refers to both the full length 

polypeptide encoded by the recited polynucleotide, the polypeptide encoded by the gene 

represented by the recited polynucleotide, as well as portions or fragments thereof. 

10 "Polypeptides" also includes variants of the naturally occurring proteins, where such 

variants are homologous or substantially similar to the naturally occurring protein, and can 
be of an origin of the same or different species as the naturally occurring protein (e.g., 
human, murine, or some other species that naturally expresses the recited polypeptide, 
usually a mammalian species). In general, variant polypeptides have a sequence that has at 

15 least about 80%, usually at least about 90%, and more usually at least about 98% sequence 
identity with a differentially expressed polypeptide of the invention, as measured by BLAST 
using the parameters described above. The variant polypeptides can be naturally or non- 
naturally glycosylated, i.e., the polypeptide has a glycosylation pattern that differs from the 
glycosylation pattern found in the corresponding naturally occurring protein. 

20 The invention also encompasses homologs of the disclosed polypeptides (or 

fragments thereof) where the homologs are isolated from other species, i.e. other animal or 
plant species, where such homologs, usually mammalian species, e.g. rodents, such as mice, 
rats; domestic animals, e.g., horse, cow, dog, cat; and humans. By homolog is meant a 
polypeptide having at least about 35%, usually at least about 40% and more usually at least 

25 about 60% amino acid sequence identity a particular differentially expressed protein as 
identified above, where sequence identity is determined using the BLAST algorithm, with 
the parameters described supra. 

In general, the polypeptides of the subject invention are provided in a non-naturally 
occurring environment, e.g. are separated from their naturally occurring environment. In 

30 certain embodiments, the subject protein is present in a composition that is enriched for the 
protein as compared to a control. As such, purified polypeptide is provided, where by 
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purified is meant that the protein is present in a composition that is substantially free of non- 

differentially expressed polypeptides, where by substantially free is meant that less than 

90%, usually less than 60% and more usually less than 50% of the composition is made up 

of non-differentially expressed polypeptides. 

5 Also within the scope of the invention are variants; variants of polypeptides include 

mutants, fragments, and fusions. Mutants can include amino acid substitutions, additions or 

deletions. The amino acid substitutions can be conservative amino acid substitutions or 

substitutions to eliminate non-essential amino acids, such as to alter a glycosylation site, a 

phosphorylation site or an acetylation site, or to minimize misfolding by substitution or 

10 deletion of one or more cysteine residues that are not necessary for function. Conservative 
amino acid substitutions are those that preserve the general charge, 
hydrophobicity/hydrophilicity, and/or steric bulk of the amino acid substituted. For 
example, substitutions between the following groups are conservative: Gly/Ala, Val/Ile/Leu, 
Asp/Glu, Lys/Arg, Asn/Gln, Ser/Cys, Thr, and Phe/Trp/Tyr. 

15 Variants can be designed so as to retain biological activity of a particular region of 

the protein (e.g., a functional domain and/or, where the polypeptide is a member of a protein 
family, a region associated with a consensus sequence). In a non-limiting example, Osawa et 
al.jBiochem. Mol Int. (1994) 54:1003, discusses the actin binding region of a protein from 
several different species. The actin binding regions of the these species are considered 

20 homologous based on the fact that they have amino acids that fall within "homologous 

residue groups." Homologous residues are judged according to the following groups (using 
single letter amino acid designations): STAG; ILVMF; HRK; DEQN; and FYW. For 
example, and S, a T, an A or a G can be in a position and the function (in this case actin 
binding) is retained. 

25 Additional guidance on amino acid substitution is available from studies of protein 

evolution. Go et al, Int. J. Peptide Protein Res. (1980) 75:21 1, classified amino acid residue 
sites as interior or exterior depending on their accessibility. More frequent substitution on 
exterior sites was confirmed to be general in eight sets of homologous protein families 
regardless of their biological functions and the presence or absence of a prosthetic group. 

30 Virtually all types of amino acid residues had higher mutabilities on the exterior than in the 
interior. No correlation between mutability and polarity was observed of amino acid 
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residues in the interior and exterior, respectively. Amino acid residues were classified into 

one of three groups depending on their polarity: polar (Arg, Lys, His, Gin, Asn, Asp, and 

Glu); weak polar (Ala, Pro, Gly, Thr, and Ser), and nonpolar (Cys, Val, Met, He, Leu, Phe, 

Tyr, and Tip). Amino acid replacements during protein evolution were very conservative: 

5 88% and 76% of them in the interior or exterior, respectively, were within the same group of 

the three. Inter-group replacements are such that weak polar residues are replaced more 

often by nonpolar residues in the interior and more often by polar residues on the exterior. 

Additional guidance for production of polypeptide variants is provided in Querol et 

al, Prot. Eng. (1996) 9:265, which provides general rules for amino acid substitutions to 

1 0 enhance protein thermostability. New glycosylation sites can be introduced as discussed in 
Olsen and Thomsen, 1 Gen. Microbiol (1991) 137:579. An additional disulfide bridge can 
be introduced, as discussed by Perry and Wetzel, Science (1984) 225:555; Pantoliano et al., 
Biochemistry (1987) 25:2077; Matsumura et al, Nature (1989) 342:291; Nishikawa et al, 
Protein Eng. (1990) 5:443; Takagi et al, J. Biol Chem. (1990) 255:6874; Clarke et al, 

15 Biochemistry (1993) 32:4322; and Wakarchuk et al, Protein Eng. (1994) 7:1379. Metal 
binding sites can be introduced, according to Toma et al, Biochemistry (1991) 30:97, and 
Haezerbrouck et al, Protein Eng. (1993) 5:643. Substitutions with prolines in loops can be 
made according to Masul et al, Appl Env. Microbiol. (1994) 50:3579; and Hardy et al, 
FEBSLett. 377:89. 

20 Cysteine-depleted muteins are considered variants within the scope of the invention. 

These variants can be constructed according to methods disclosed in U.S. Patent No. 
4,959,314, which discloses substitution of cysteines with other amino acids, and methods 
for assaying biological activity and effect of the substitution. Such methods are suitable for 
proteins according to this invention that have cysteine residues suitable for such 

25 substitutions, for example to eliminate disulfide bond formation. 

Variants also include fragments of the polypeptides disclosed herein, particularly 
biologically active fragments and/or fragments corresponding to functional domains. 
Fragments of interest will typically be at least about 1 0 aa to at least about 1 5 aa in length, 
usually at least about 50 aa in length, and can be as long as 300 aa in length or longer, but 

30 will usually not exceed about 1 000 aa in length, where the fragment will have a stretch of 
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amino acids that is identical to a polypeptide encoded by a polynucleotide having a 

sequence of any SEQ ID NOS: 1-844, or a homolog thereof. 

The protein variants described herein are encoded by polynucleotides that are within 

the scope of the invention. The genetic code can be used to select the appropriate codons to 

5 construct the corresponding variants. 

VI. Computer-Related Embodiments 

In general, a library of polynucleotides is a collection of sequence information, which 
information is provided in either biochemical form (e.g. , as a collection of polynucleotide 

10 molecules), or in electronic form (e.g., as a collection of polynucleotide sequences stored in a 
computer-readable form, as in a computer system and/or as part of a computer program). 
The sequence information of the polynucleotides can be used in a variety of ways, e.g., as a 
resource for gene discovery, as a representation of sequences expressed in a selected cell 
type (e.g., cell type markers), and/or as markers of a given disease or disease state. In 

15 general, a disease marker is a representation of a gene product that is present in all affected 
by disease either at an increased or decreased level relative to a normal cell (e.g., a cell of the 
same or similar type that is not substantially affected by disease). For example, a 
polynucleotide sequence in a library can be a polynucleotide that represents an mRNA, 
polypeptide, or other gene product encoded by the polynucleotide, that is either 

20 overexpressed or underexpressed in a breast ductal cell affected by cancer relative to a 
normal (i.e., substantially disease-free) breast cell. 

The nucleotide sequence information of the library can be embodied in any suitable 
form, e g, electronic or biochemical forms. For example, a library of sequence information 
embodied in electronic form includes an accessible computer data file (or, in biochemical 

25 form, a collection of nucleic acid molecules) that contains the representative nucleotide 

sequences of genes that are differentially expressed (e.g, overexpressed or underexpressed) 
as between, for example, i) a cancerous cell and a normal cell; ii) a cancerous cell and a 
dysplastic cell; iii) a cancerous cell and a cell affected by a disease or condition other than 
cancer; iv) a metastatic cancerous cell and a normal cell and/or non-metastatic cancerous 

30 cell; v) a malignant cancerous cell and a non-malignant cancerous cell (or a normal cell) 

and/or vi) a dysplastic cell relative to a normal cell. Other combinations and comparisons of 
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cells affected by various diseases or stages of disease will be readily apparent to the 

ordinarily skilled artisan. Biochemical embodiments of the library include a collection of 

nucleic acids that have the sequences of the genes in the library, where the nucleic acids can 

correspond to the entire gene in the library or to a fragment thereof, as described in greater 

5 detail below. 

The polynucleotide libraries of the subject invention include sequence information of 
a plurality of polynucleotide sequences, where at least one of the polynucleotides has a 
sequence of any of SEQ ID NOS: 1-844. By plurality is meant at least 2, usually at least 3 
and can include up to all of SEQ ID NOS: 1-844. The length and number of polynucleotides 

10 in the library will vary with the nature of the library, e.g., if the library is an oligonucleotide 
array, a cDNA array, a computer database of the sequence information, etc. 

Where the library is an electronic library, the nucleic acid sequence information can 
be present in a variety of media. "Media" refers to a manufacture, other than an isolated 
nucleic acid molecule, that contains the sequence information of the present invention. Such 

15 a manufacture provides the genome sequence or a subset thereof in a form that can be 

examined by means not directly applicable to the sequence as it exists in a nucleic acid. For 
example, the nucleotide sequence of the present invention, e.g. the nucleic acid sequences of 
any of the polynucleotides of SEQ ID NOS: 1-844, can be recorded on computer readable 
media, e.g. any medium that can be read and accessed directly by a computer. Such media 

20 include, but are not limited to: magnetic storage media, such as a floppy disc, a hard disc 
storage medium, and a magnetic tape; optical storage media such as CD-ROM; electrical 
storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. One of skill in the art can readily appreciate how any of the 
presently known computer readable mediums can be used to create a manufacture 

25 comprising a recording of the present sequence information. "Recorded" refers to a process 
for storing information on computer readable medium, using any such methods as known in 
the art. Any convenient data storage structure can be chosen, based on the means used to 
access the stored information. A variety of data processor programs and formats can be used 
for storage, e.g. word processing text file, database format, etc. In addition to the sequence 

30 information, electronic versions of the libraries of the invention can be provided in 

conjunction or connection with other computer-readable information and/or other types of 
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computer-readable files (e.g., searchable files, executable files, etc, including, but not limited 

to, for example, search program software, etc.). 

By providing the nucleotide sequence in computer readable form, the information can 
be accessed for a variety of purposes. Computer software to access sequence information is 
5 publicly available. For example, the BLAST (Altschul et aL y supra.) and BLAZE (Brutlag 
et ah Comp. Chem. (1993) 17:203) search algorithms on a Sybase system can be used 
identify open reading frames (ORFs) within the genome that contain homology to ORFs 
from other organisms. 

As used herein, "a computer-based system" refers to the hardware means, software 
10 means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware of the computer-based systems of the present 
invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A skilled artisan can readily appreciate that any one of the currently 
available computer-based system are suitable for use in the present invention. The data 
1 5 storage means can comprise any manufacture comprising a recording of the present sequence 
information as described above, or a memory access means that can access such a 
manufacture. 

"Search means" refers to one or more programs implemented on the computer-based 
system, to compare a target sequence or target structural motif with the stored sequence 

20 information. Search means are used to identify fragments or regions of the genome that 
match a particular target sequence or target motif A variety of known algorithms are 
publicly known and commercially available, e.g. MacPattern (EMBL), BLASTN and 
BLASTX (NCBI). A "target sequence" can be any DNA or amino acid sequence of six or 
more nucleotides or two or more amino acids, preferably from about 10 to 100 amino acids 

25 or from about 30 to 300 nucleotide residues. 

A "target structural motif," or "target motif," refers to any rationally selected 
sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration that is formed upon the folding of the target motif, or on 
consensus sequences of regulatory or active sites. There are a variety of target motifs known 

30 in the art. Protein target motifs include, but arc not limited to, enzyme active sites and signal 
sequences. Nucleic acid target motifs include, but are not limited to, hairpin structures, 
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promoter sequences and other expression elements such as binding sites for transcription 

factors. 

A variety of structural formats for the input and output means can be used to input 
and output the information in the computer-based systems of the present invention. One 
5 format for an output means ranks fragments of the genome possessing varying degrees of 
homology to a target sequence or target motif. Such presentation provides a skilled artisan 
with a ranking of sequences and identifies the degree of sequence similarity contained in the 
identified fragment. 

A variety of comparing means can be used to compare a target sequence or target 
1 0 motif with the data storage means to identify sequence fragments of the genome. A skilled 
artisan can readily recognize that any one of the publicly available homology search 
programs can be used as the search means for the computer based systems of the present 
invention. 

As discussed above, the "library" of the invention also encompasses biochemical 

15 libraries of the polynucleotides of SEQ ID NOS:l-844, e.g., collections of nucleic acids 
representing the provided polynucleotides. The biochemical libraries can take a variety of 
forms, e.g., a solution of cDNAs, a pattern of probe nucleic acids stably associated with a 
surface of a solid support (i.e., an array) and the like. Of particular interest are nucleic acid 
arrays in which one or more of SEQ ID NOS: 1-844 is represented on the array. By array is 

20 meant a an article of manufacture that has at least a substrate with at least two distinct 

nucleic acid targets on one of its surfaces, where the number of distinct nucleic acids can be 
considerably higher, typically being at least 10 nt, usually at least 20 nt and often at least 25 
nt A variety of different array formats have been developed and are known to those of skill 
in the art, including those described in 5,242,974; 5,384,261; 5,405,783; 5,412,087; 

25 5,424,186; 5,429,807; 5,436,327; 5,445,934; 5,472,672; 5,527,681; 5,529,756; 5,545,531; 
5,554,501; 5,556,752; 5,561,071; 5,599,895; 5,624,71 1; 5,639,603; 5,658,734; WO 
93/17126; WO 95/1 1995; WO 95/35505; EP 742287; and EP 799897. The arrays of the 
subject invention find use in a variety of applications, including gene expression analysis, 
drug screening, mutation analysis and the like, as disclosed in the above-listed exemplary 

30 patent documents. 
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In addition to the above nucleic acid libraries, analogous libraries of polypeptides are 

also provided, where the where the polypeptides of the library will represent at least a 

portion of the polypeptides encoded by SEQ ID NOS: 1-844. 

5 VII. Utilities 

A. Use of Polynucleotide Probes in Mapping, and in Tissue Profiling 
Polynucleotide probes, generally comprising at least 1 2 contiguous nucleotides of a 
polynucleotide as shown in the Sequence Listing, are used for a variety of purposes, such as 
chromosome mapping of the polynucleotide and detection of transcription levels. Additional 

10 disclosure about preferred regions of the disclosed polynucleotide sequences is found in the 
Examples. A probe that hybridizes specifically to a polynucleotide disclosed herein should 
provide a detection signal at least 5-, 10-, or 20-fold higher than the background 
hybridization provided with other unrelated sequences. 

Probes in Detection of Expression Levels. Nucleotide probes are used to detect 

15 expression of a gene corresponding to the provided polynucleotide. The references describe 
an example of a sandwich nucleotide hybridization assay. For example, in Northern blots, 
mRNA is separated electrophoretically and contacted with a probe. A probe is detected as 
hybridizing to an mRNA species of a particular size. The amount of hybridization is 
quantitated to determine relative amounts of expression, for example under a particular 

20 condition. Probes are also used to detect products of amplification by polymerase chain 
reaction. The products of the reaction are hybridized to the probe and hybrids are detected. 
Probes are used for in situ hybridization to cells to detect expression. Probes can also be 
used in vivo for diagnostic detection of hybridizing sequences. Probes are typically labeled 
with a radioactive isotope. Other types of detectable labels can be used such as 

25 chromophores, fluors, and enzymes. Other examples of nucleotide hybridization assays are 
described in WO92/02526 and U.S. Patent No. 5,124,246. 

Alternatively, the Polymerase Chain Reaction (PCR) is another means for detecting 
small amounts of target nucleic acids (see, e.g., Mullis et ai, Meth Enzymol (1987) 
755:335; U.S. Patent No. 4,683,195; and U.S. Patent No. 4,683,202). Two primer 

30 polynucleotides nucleotides hybridize with the target nucleic acids and are used to prime the 
reaction. The primers can be composed of sequence within or 3' and 5' to the 
polynucleotides of the Sequence Listing. Alternatively, if the primers are 3' and 5' to these 
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polynucleotides, they need not hybridize to them or the complements. A thermostable 

polymerase creates copies of target nucleic acids from the primers using the original target 

nucleic acids as a template. After a large amount of target nucleic acids is generated by the 

polymerase, it is detected by methods such as Southern blots. When using the Southern blot 

5 method, the labeled probe will hybridize to a polynucleotide of the Sequence Listing or 

complement. 

Furthermore, mRNA or cDNA can be detected by traditional blotting techniques 
described in Sambrook et a/., "Molecular Cloning: A Laboratory Manual" (New York, Cold 
Spring Harbor Laboratory, 1989). mRNA or cDNA generated from mRNA using a 
1 0 polymerase enzyme can be purified and separated using gel electrophoresis. The nucleic 
acids on the gel are then blotted onto a solid support, such as nitrocellulose. The solid 
support is exposed to a labeled probe and then washed to remove any unhybridized probe. 
Next, the duplexes containing the labeled probe are detected. Typically, the probe is labeled 
with radioactivity. 

15 Mapping. Polynucleotides of the present invention are used to identify a 

chromosome on which the corresponding gene resides. Such mapping can be useful in 
identifying the function of the polynucleotide-related gene by its proximity to other genes 
with known function. Function can also be assigned to the polynucleotide-related gene when 
particular syndromes or diseases map to the same chromosome. For example, use of 

20 polynucleotide probes in identification and quantification of nucleic acid sequence 
aberrations is described in U.S. Patent No. 5,783,387. 

For example, fluorescence in situ hybridization (FISH) on normal metaphase spreads 
facilitates comparative genomic hybridization to allow total genome assessment of changes 
in relative copy number of DNA sequences. See Schwartz and Samad, Curr. Opin, 

25 Biotechnol ( 1 994) 5:70; Kallioniemi et aL 9 Sem. Cancer Biol ( 1 993) 4:4 1 ; Valdes et al , 
Methods in Molecular Biology (1 997) 68: 1 , Boultwood, ed., Human Press, Totowa, NJ. 
Preparations of human metaphase chromosomes are prepared using standard cytogenetic 
techniques from human primary tissues or cell lines. Nucleotide probes comprising at least 
12 contiguous nucleotides selected from the nucleotide sequence shown in the Sequence 

30 Listing are used to identify the corresponding chromosome. The nucleotide probes are 

labeled, for example, with a radioactive, fluorescent, biotinylated, or chemiluminescent label, 
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and detected by well known methods appropriate for the particular label selected. Protocols 

for hybridizing nucleotide probes to preparations of metaphase chromosomes are also well 

known in the art. A nucleotide probe will hybridize specifically to nucleotide sequences in 

the chromosome preparations that are complementary to the nucleotide sequence of the 

5 probe. 

Polynucleotides are mapped to particular chromosomes using, for example, radiation 
hybrids or chromosome-specific hybrid panels. See Leach et al, Advances in Genetics, 
(1995) 35:63-99; Walter et aL, Nature Genetics (1994) 7:22; Walter and Goodfellow, Trends 
in Genetics (1992) 9:352. Panels for radiation hybrid mapping are available from Research 

10 Genetics, Inc., Huntsville, Alabama, USA. Databases for markers using various panels are 
available via the world wide web at http:/F/shgc-www.stanford.edu; and http://www- 
genome.wi.mit.edu/cgi-bin/contig/rhmapper.pl . The statistical program RHMAP can be used 
to construct a map based on the data from radiation hybridization with a measure of the 
relative likelihood of one order versus another. RHMAP is available via the world wide web 

1 5 at http://www.sph.umich.edu/group/statgen/software. 

In addition, commercial programs are available for identifying regions of 
chromosomes commonly associated with disease, such as cancer. Polynucleotides based on 
the polynucleotides of the invention can be used to probe these regions. For example, if 
through profile searching a provided polynucleotide is identified as corresponding to a gene 

20 encoding a kinase, its ability to bind to a cancer-related chromosomal region will suggest its 
role as a kinase in one or more stages of tumor cell development/growth. Although some 
experimentation would be required to elucidate the role, the polynucleotide constitutes a new 
material for isolating a specific protein that has potential for developing a cancer diagnostic 
or therapeutic. 

25 Tissue Typing or Profiling. Expression of specific mRNA corresponding to the 

provided polynucleotides can vary in different cell types and can be tissue-specific. This 
variation of mRNA levels in different cell types can be exploited with nucleic acid probe 
assays to determine tissue types. For example, PCR, branched DNA probe assays, or 
blotting techniques utilizing nucleic acid probes substantially identical or complementary to 

30 polynucleotides listed in the Sequence Listing can determine the presence or absence of the 
corresponding cDNA or mRNA. 
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For example, a metastatic lesion is identified by its developmental organ or tissue 

source by identifying the expression of a particular marker of that organ or tissue. If a 

polynucleotide is expressed only in a specific tissue type, and a metastatic lesion is found to 

express that polynucleotide, then the developmental source of the lesion has been identified. 

5 Expression of a particular polynucleotide is assayed by detection of either the corresponding 

mRNA or the protein product. Immunological methods, such as antibody staining, are used 

to detect a particular protein product. Hybridization methods can be used to detect particular 

mRNA species, including but not limited to in situ hybridization and Northern blotting. 

Use of Polymorphisms. A polynucleotide of the invention will be useful in forensics, 

10 genetic analysis, mapping, and diagnostic applications if the corresponding region of a gene 
is polymorphic in the human population. Particular polymorphic forms of the provided 
polynucleotides can be used to either identify a sample as deriving from a suspect or rule out 
the possibility that the sample derives from the suspect. Any means for detecting a 
polymorphism in a gene are used, including but not limited to electrophoresis of protein 

1 5 polymorphic variants, differential sensitivity to restriction enzyme cleavage, and 
hybridization to allele-specific probes. 
B. Antibody Production 

Expression products of a polynucleotide of the invention, the corresponding mRNA 
or cDNA, or the corresponding complete gene are prepared and used for raising antibodies 

20 for experimental, diagnostic, and therapeutic purposes. For polynucleotides to which a 

corresponding gene has not been assigned, this provides an additional method of identifying 
the corresponding gene. The polynucleotide or related cDNA is expressed as described 
above, and antibodies are prepared. These antibodies are specific to an epitope on the 
polypeptide encoded by the polynucleotide, and can precipitate or bind to the corresponding 

25 native protein in a cell or tissue preparation or in a cell-free extract of an in vitro expression 
system. 

Immunogens for raising antibodies are prepared by mixing the polypeptides encoded 
by the polynucleotides of the present invention with adjuvants. Alternatively, polypeptides 
are made as fusion proteins to larger immunogenic proteins. Polypeptides are also 
30 covalently linked to other larger immunogenic proteins, such as keyhole limpet hemocyanin. 
Immunogens are typically administered intradermally, subcutaneously, or intramuscularly. 
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Immunogens are administered to experimental animals such as rabbits, sheep, and mice, to 

generate antibodies. Optionally, the animal spleen cells are isolated and fused with myeloma 

cells to form hybridomas which secrete monoclonal antibodies. Such methods are well 

known in the art. According to another method known in the art, the selected polynucleotide 

5 is administered directly, such as by intramuscular injection, and expressed in vivo. The 

expressed protein generates a variety of protein-specific immune responses, including 

production of antibodies, comparable to administration of the protein. 

Preparations of polyclonal and monoclonal antibodies specific for polypeptides 

encoded by a selected polynucleotide are made using standard methods known in the art. 

10 The antibodies specifically bind to epitopes present in the polypeptides encoded by 
polynucleotides disclosed in the Sequence Listing. Typically, at least 6, 8, 10, or 12 
contiguous amino acids are required to form an epitope. However, epitopes which involve 
non-contiguous amino acids may require more, for example at least 15, 25, or 50 amino 
acids. A short sequence of a polynucleotide may then be unsuitable for use as an epitope to 

15 raise antibodies for identifying the corresponding novel protein, because of the potential for 
cross-reactivity with a known protein. However, the antibodies can be useful for other 
purposes, particularly if they identify common structural features of a known protein and a 
novel polypeptide encoded by a polynucleotide of the invention. 

Antibodies that specifically bind to human polypeptides encoded by the provided 

20 polypeptides should provide a detection signal at least 5-, 1 0-, or 20-fold higher than a 
detection signal provided with other proteins when used in Western blots or other 
immunochemical assays. Preferably, antibodies that specifically polypeptides of the 
invention do not bind to other proteins in immunochemical assays at detectable levels and 
can immunoprecipitate the specific polypeptide from solution. 

25 To test for the presence of serum antibodies to the polypeptide of the invention in a 

human population, human antibodies are purified by methods well known in the art. 
Preferably, the antibodies are affinity purified by passing antiserum over a column to which 
the corresponding selected polypeptide or fusion protein is bound. The bound antibodies can 
then be eluted from the column, for example using a buffer with a high salt concentration. 
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In addition to the antibodies discussed above, genetically engineered antibody 

derivatives are made, such as single chain antibodies, according to methods well known in 

the art. 

C. Use of Polynucleotides to Construct Arrays for Diagnostics 

5 Polynucleotide arrays provide a high throughput technique that can assay a large 

number of polynucleotide sequences in a sample. This technology can be used as a 
diagnostic and as a tool to test for differential expression to determine function of an 
encoded protein. Arrays can be created by spotting polynucleotide probes onto a substrate 
(e.g., glass, nitrocelllose, eta) in a two-dimensional matrix or array having bound probes. 

10 The probes can be bound to the substrate by either covalent bonds or by non-specific 

interactions, such as hydrophobic interactions. Samples of polynucleotides can be detectably 
labeled (e.g., using radioactive or fluorescent labels) and then hybridized to the probes. 
Double stranded polynucleotides, comprising the labeled. sample polynucleotides bound to 
probe polynucleotides, can be detected once the unbound portion of the sample is washed 

15 away. Techniques for constructing arrays and methods of using these arrays are described in 
EP No. 0 799 897; PCT No. WO 97/29212; PCT No. WO 97/27317; EP No. 0 785 280; PCT 
No. WO 97/02357; U.S. Pat. No. 5,593,839; U.S. Pat. No. 5,578,832; EP No. 0 728 520; 
U.S. Pat. No. 5,599,695; EP No. 0 721 016; U.S. Pat. No. 5,556,752; PCT No. WO 
95/22058; and U.S. Pat. No. 5,631,734. 

20 As discussed in some detail above, arrays can be used to examine differential 

expression of genes and can be used to determine gene function. For example, arrays of the 
instant polynucleotide sequences can be used to determine if any of the provided 
polynucleotides are differentially expressed between a test cell and control cell (e.g., cancer 
cells and normal cells). For example, high expression of a particular message in a cancer 

25 cell, which is not observed in a corresponding normal cell, can indicate a cancer specific 
protein. Exemplary uses of arrays are further described in, for example, Pappalarado et aL, 
Sent. Radiation Oncol. (1998) 5:217; and Ramsay Nature Biotechnol (1998) 76:40. 

D. Differential Expression 

The polynucleotides of the invention can also be used to detect differences in 
30 expression levels between two cells, e.g. , as a method to identify abnormal or diseased tissue 
in a human. For polynucleotides corresponding to profiles of protein families as described 
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above, the choice of tissue can be selected according to the putative biological function. In 

general, the expression of a gene corresponding to a specific polynucleotide is compared 

between a first tissue that is suspected of being diseased and a second, normal tissue of the 

human. The tissue suspected of being abnormal or diseased can be derived from a different 

5 tissue type of the human, but preferably it is derived from the same tissue type; for example 

an intestinal polyp or other abnormal growth should be compared with normal intestinal 

tissue. The normal tissue can be the same tissue as that of the test sample, or any normal 

tissue of the patient, especially those that express the polynucleotide-related gene of interest 

(e.g., brain, thymus, testis, heart, prostate, placenta, spleen, small intestine, skeletal muscle, 

10 pancreas, and the mucosal lining of the colon). A difference between the polynucleotide- 
related gene, mRNA, or protein in the two tissues which are compared, for example in 
molecular weight, amino acid or nucleotide sequence, or relative abundance, indicates a 
change in the gene, or a gene which regulates it, in the tissue of the human that was 
suspected of being diseased. Examples of detection of differential expression and its use in 

15 diagnosis of cancer are described in U.S. Patent Nos. 5,688,641 and 5,677,125. 

The polynucleotide-related genes in the two tissues are compared by any means 
known in the art. For example, the two genes can be sequenced, and the sequence of the 
gene in the tissue suspected of being diseased compared with the gene sequence in the 
normal tissue. The genes corresponding to a provided polynucleotide, or portions thereof, in 

20 the two tissues are amplified, for example using nucleotide primers based on the nucleotide 
sequence shown in the Sequence Listing, using the polymerase chain reaction. The 
amplified genes or portions of genes are hybridized to detectably labeled nucleotide probes 
selected from a nucleotide sequence shown in the Sequence Listing. A difference in the 
nucleotide sequence of the isolated gene in the tissue suspected of being diseased compared 

25 with the normal nucleotide sequence suggests a role of the gene product encoded by the 
subject polynucleotide in the disease, and provides guidance for preparing a therapeutic 
agent. 

Alternatively, mRNA corresponding to a provided polynucleotide in the two tissues 
is compared. PolyA + RNA is isolated from the two tissues as is known in the art. For 
30 example, one of skill in the art can readily determine differences in the size or amount of 
mRNA transcripts between the two tissues using Northern blots and detectably labeled 
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nucleotide probes selected from the nucleotide sequence shown in the Sequence Listing. 

Increased or decreased expression of a given mRNA in a tissue sample suspected of being 

diseased, compared with the expression of the same mRNA in a normal tissue, suggests that 

the expressed protein has a role in the disease, and also provides a lead for preparing a 

5 therapeutic agent. 

The comparison can also be accomplished by analyzing polypeptides between the 

matched samples. The sizes of the proteins in the two tissues are compared, for example, 

using antibodies of the present invention to detect polypeptides in Western blots of protein 

extracts from the two tissues. Other changes, such as expression levels and subcellular 

10 localization, can also be detected immunologically, using antibodies to the corresponding 
protein. A higher or lower level of expression of a given polypeptide in a tissue suspected of 
being diseased, compared with the same protein expression level in a normal tissue, is 
indicative that the expressed protein has a role in the disease, and provides guidance for 
preparing a therapeutic agent. 

15 Similarly, comparison of polynucleotide sequences or of gene expression products, 

e.g., mRNA and protein, between a human tissue that is suspected of being diseased and a 
normal tissue of a human, are used to follow disease progression or remission in the human. 
Such comparisons are made as described above. For example, increased or decreased 
expression of a gene corresponding to an inventive polynucleotide in the tissue suspected of 

20 being neoplastic can indicate the presence of neoplastic cells in the tissue. The degree of 
increased expression of a given gene in the neoplastic tissue relative to expression of the 
same gene in normal tissue, or differences in the amount of increased expression of a given 
gene in the neoplastic tissue over time, is used to assess the progression of the neoplasia in 
that tissue or to monitor the response of the neoplastic tissue to a therapeutic protocol over 

25 time. 

The expression pattern of any two cell types can be compared, such as low and high 
metastatic tumor cell lines, malignant or non-malignant cells, or cells from tissue which have 
and have not been exposed to a therapeutic agent. A genetic predisposition to disease in a 
human is detected by comparing expression levels of an mRNA or protein corresponding to 
30 a polynucleotide of the invention in a fetal tissue with levels associated in normal fetal 
tissue. Fetal tissues that are used for this purpose include, but are not limited to, amniotic 
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fluid, chorionic villi, blood, and the blastomere of an in vitro-fertilized embryo. The 

comparable normal polynucleotide-related gene is obtained from any tissue. The mRNA or 

protein is obtained from a normal tissue of a human in which the polynucleotide-related gene 

is expressed. Differences such as alterations in the nucleotide sequence or size of the same 

5 product of the fetal polynucleotide-related gene or mRNA, or alterations in the molecular 

weight, amino acid sequence, or relative abundance of fetal protein, can indicate a germline 

mutation in the polynucleotide-related gene of the fetus, which indicates a genetic 

predisposition to disease. Particular diagnostic and prognostic uses of the disclosed 

polynucleotides are described in more detail below. 

10 E. Diagnostic, Prognostic, and Other Uses Based On Differential Expression 

In general, diagnostic methods of the invention for involve detection of a level or 
amount of a gene product, particularly a differentially expressed gene product, in a test 
sample obtained from a patient suspected of having or being susceptible to a disease {e.g., 
breast cancer, lung cancer, colon cancer and/or metastatic forms thereof), and comparing the 

15 detected levels to those levels found in normal cells (e.g., cells substantially unaffected by 
cancer) and/or other control cells (e.g., to differentiate a cancerous cell from a cell affected 
by dysplasia). Furthermore, the severity of the disease can be assessed by comparing the 
detected levels of a differentially expressed gene product with those levels detected in 
samples representing the levels of differentially gene product associated with varying 

20 degrees of severity of disease. 

The term "differentially expressed gene" is intended to encompass a polynucleotide 
that can, for example, include an open reading frame encoding a gene product (e.g., a 
polypeptide), and/or introns of such genes and adjacent 5' and 3' non-coding nucleotide 
. sequences involved in the regulation of expression, up to about 20 kb beyond the coding 

25 region, but possibly farther in either direction. The gene can be introduced into an 

appropriate vector for extrachromosomal maintenance or for integration into a host genome. 
In general, a difference in expression level associated with a decrease in expression level of 
at least about 25%, usually at least about 50% to 75%, more usually at least about 90% or 
more is indicative of a differentially expressed gene of interest, i.e., a gene that is 

30 underexpressed or down-regulated in the test sample relative to a control sample. 

Furthermore, a difference in expression level associated with an increase in expression of at 
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least about 25%, usually at least about 50% to 75%, more usually at least about 90% and can 

be at least about 1 !/ 2 -fold, usually at least about 2-fold to about 10-fold, and can be about 

100-fold to about 1,000-fold increase relative to a control sample is indicative of a 

differentially expressed gene of interest, i.e., an overexpressed or up-regulated gene. 

5 "Differentially expressed polynucleotide" as used herein means a nucleic acid 

molecule (RNA or DNA) having a sequence that represents a differentially expressed gene, 

e.g., the differentially expressed polynucleotide comprises a sequence (e.g., an open reading 

frame encoding a gene product) that uniquely identifies a differentially expressed gene so 

that detection of the differentially expressed polynucleotide in a sample is correlated with the 

10 presence of a differentially expressed gene in a sample. "Differentially expressed 

polynucleotides" is also meant to encompass fragments of the disclosed polynucleotides, 
e.g., fragments retaining biological activity, as well as nucleic acids homologous, 
substantially similar, or substantially identical (e.g., having about 90% sequence identity) to 
the disclosed polynucleotides. 

15 Methods of the subject invention useful in diagnosis or prognosis typically involve 

comparison of the abundance of a selected differentially expressed gene product in a sample 
of interest with that of a control to determine any relative differences in the expression of the 
gene product, where the difference can be measured qualitatively and/or quantitatively. 
Quantitation can be accomplished, for example, by comparing the level of expression 

20 product detected in the sample with the amounts of product present in a standard curve. A 
comparison can be made visually; by using a technique such as densitometry, with or 
without computerized assistance; by preparing a representative library of cDNA clones of 
mRNA isolated from a test sample, sequencing the clones in the library to determine that 
number of cDNA clones corresponding to the same gene product, and analyzing the number 

25 of clones corresponding to that same gene product relative to the number of clones of the 
same gene product in a control sample; or by using an array to detect relative levels of 
hybridization to a selected sequence or set of sequences, and comparing the hybridization 
pattern to that of a control. The differences in expression are then correlated with the 
presence or absence of an abnormal expression pattern. A variety of different methods for 

30 determining the nucleic acid abundance in a sample are known to those of skill in the art, 
where particular methods of interest include those described in: Pietu et al Genome Res. 
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(1996) 6:492; Zhao et al, Gene (1995) 755:207; Soares , Curr. Opin. Biotechnol (1977) 8: 

542; Raval, J. Pharmacol Toxicol Methods (1994) 52:125; Chalifour et al, Anal Biochem 

(1994) 2 J 6:299; Stolz et al, Mol Biotechnol (1996) <5:225; Hong et al 9 Biosci. Reports 

(1982) 2:907; and McGraw, Anal Biochem. (1984) 745:298. Also of interest are the 

5 methods disclosed in WO 97/273 17, the disclosure of which is herein incorporated by 

reference. 

In general, diagnostic assays of the invention involve detection of a gene product of a 
the polynucleotide sequence {e.g., mRNA or polypeptide) that corresponds to a sequence of 
SEQ ID NOS: 1 -844. The patient from whom the sample is obtained can be apparently 

10 healthy, susceptible to disease {e.g., as determined by family history or exposure to certain 
environmental factors), or can already be identified as having a condition in which altered 
expression of a gene product of the invention is implicated. 

In the assays of the invention, the diagnosis can be determined based on detected 
gene product expression levels of a gene product encoded by at least one, preferably at least 

15 two or more, at least 3 or more, or at least 4 or more of the polynucleotides having a 

sequence set forth in SEQ ID NOS: 1-844, and can involve detection of expression of genes 
corresponding to all of SEQ ID NOS: 1-844 and/or additional sequences that can serve as 
additional diagnostic markers and/or reference sequences. Where the diagnostic method is 
designed to detect the presence or susceptibility of a patient to cancer, the assay preferably 

20 involves detection of a gene product encoded by a gene corresponding to a polynucleotide 
that is differentially expressed in cancer. For example, a higher level of expression of a 
polynucleotide corresponding to SEQ ID NO:52 relative to a level associated with a normal 
sample can indicate the presence of cancer in the patient from whom the sample is derived. 
In another example, detection of a lower level of a polynucleotide corresponding to SEQ ID 

25 NO:39 relative to a normal level is indicative of the presence of cancer in the patient. 
Further examples of such differentially expressed polynucleotides are described in the 
Examples below. Given the provided polynucleotides and information regarding their 
relative expression levels provided herein, assays using such polynucleotides and detection 
of their expression levels in diagnosis and prognosis will be readily apparent to the ordinarily 

30 skilled artisan. 
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Any of a variety of detectable labels can be used in connection with the various 

embodiments of the diagnostic methods of the invention. Suitable detectable labels include 

fluorochromes,(e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 

phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2\7'-dimethoxy-4\5'- 

5 dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy- 

2 , ,4',7 , ,4,7-hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) orN,N,N\N'- 

tetramethyl-6-carboxyrhodamine (TAMRA)), radioactive labels, (e.g. 32 P, 35 S, 3 H, etc.), and 

the like. The detectable label can involve a two stage systems (e.g. , biotin-avidin, hapten- 

anti-hapten antibody, etc.) 

10 Reagents specific for the polynucleotides and polypeptides of the invention, such as 

antibodies and nucleotide probes, can be supplied in a kit for detecting the presence of an 
expression product in a biological sample. The kit can also contain buffers or labeling 
components, as well as instructions for using the reagents to detect and quantify expression 
products in the biological sample. Exemplary embodiments of the diagnostic methods of the 

1 5 invention are described below in more detail. 

Polypeptide detection in diagnosis. In one embodiment, the test sample is assayed 
for the level of a differentially expressed polypeptide. Diagnosis can be accomplished using 
any of a number of methods to determine the absence or presence or altered amounts of the 
differentially expressed polypeptide in the test sample. For example, detection can utilize 

20 staining of cells or histological sections with labeled antibodies, performed in accordance 
with conventional methods. Cells can be permeabilized to stain cytoplasmic molecules. In 
general, antibodies that specifically bind a differentially expressed polypeptide of the 
invention are added to a sample, and incubated for a period of time sufficient to allow 
binding to the epitope, usually at least about 10 minutes. The antibody can be detectably 

25 labeled for direct detection (e.g., using radioisotopes, enzymes, fluorescers, 

chemiluminescers, and the like), or can be used in conjunction with a second stage antibody 
or reagent to detect binding (e.g., biotin with horseradish peroxidase-conjugated avidin, a 
secondary antibody conjugated to a fluorescent compound, e.g. fluorescein, rhodamine, 
Texas red, etc.). The absence or presence of antibody binding can be determined by various 

30 methods, including flow cytometry of dissociated cells, microscopy, radiography, 

scintillation counting, etc. Any suitable alternative methods can of qualitative or quantitative 
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detection of levels or amounts of differentially expressed polypeptide can be used, for 

example ELISA, western blot, immunoprecipitation, radioimmunoassay, etc. 

In general, the detected level of differentially expressed polypeptide in the test 

sample is compared to a level of the differentially expressed gene product in a reference or 

5 control sample, e.g., in a normal cell (negative control) or in a cell having a known disease 

state (positive control). For example, a higher level of expression of a polypeptide encoded 

by SEQ ID NO:52 relative to a level associated with a normal sample can indicate the 

presence of cancer in the patient from whom the sample is derived. In another example, 

detection of a lower level of the polypeptide encoded by SEQ ID NO:39 relative to a normal 

1 0 level is indicative of the presence of cancer in the patient. 

mRNA detection. The diagnostic methods of the invention can also or alternatively 
involve detection of mRNA encoded by a gene corresponding to a differentially expressed 
polynucleotides of the invention. Any suitable qualitative or quantitative methods known in 
the art for detecting specific mRNAs can be used. mRNA can be detected by, for example, 

15 in situ hybridization in tissue sections, by reverse transcriptase-PCR, or in Northern blots 
containing poly A+ mRNA. One of skill in the art can readily use these methods to 
determine differences in the size or amount of mRNA transcripts between two samples. For 
example, the level of mRNA of the invention in a tissue sample suspected of being 
cancerous or dysplastic is compared with the expression of the mRNA in a reference sample, 

20 e.g., a positive or negative control sample (e.g., normal tissue, cancerous tissue, etc.). In a 
specific non-limiting example, a higher level of mRNA corresponding to SEQ ID NO:52 
relative to a level associated with a normal sample can indicate the presence of cancer in the 
patient from whom the sample is derived. In another example, detection of a lower level of 
mRNA corresponding to SEQ ID NO:39 relative to a normal level is indicative of the 

25 presence of cancer in the patient. 

Any suitable method for detecting and comparing mRNA expression levels in a 
sample can be used in connection with the diagnostic methods of the invention (see, e.g., 
U.S. 5,804,382). For example, mRNA expression levels in a sample can be determined by 
generation of a library of expressed sequence tags (ESTs) from the sample, where the EST 

30 library is representative of sequences present in the sample (Adams, et al, (1991) Science 

252: 1 65 1 ). Enumeration of the relative representation of ESTs within the library can be used 
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to approximate the relative representation of the gene transcript within the starting sample. 

The results of EST analysis of a test sample can then be compared to EST analysis of a 

reference sample to determine the relative expression levels of a selected polynucleotide, 

particularly a polynucleotide corresponding to one or more of the differentially expressed 

5 genes described herein. 

Alternatively, gene expression in a test sample can be performed using serial analysis 

of gene expression (SAGE) methodology (Velculescu et al., Science (1995) 270:484). In 

short, SAGE involves the isolation of short unique sequence tags from a specific location 

within each transcript (e.g., a sequence of any one of SEQ ID NOS:l-6). The sequence tags 

1 0 are concatenated, cloned, and sequenced. The frequency of particular transcripts within the 
starting sample is reflected by the number of times the associated sequence tag is 
encountered with the sequence population. 

Gene expression in a test sample can also be analyzed using differential display (DD) 
methodology. In DD, fragments defined by specific sequence delimiters (e.g., restriction 

15 enzyme sites) are used as unique identifiers of genes, coupled with information about 

fragment length or fragment location within the expressed gene. The relative representation 
of an expressed gene with a sample can then be estimated based on the relative 
representation of the fragment associated with that gene within the pool of all possible 
fragments. Methods and compositions for carrying out DD are well known in the art, see, 

20 e.g., U.S. 5,776,683; and U.S. 5,807,680. 

Alternatively, gene expression in a sample using hybridization analysis, which is 
based on the specificity of nucleotide interactions. Oligonucleotides or cDNA can be used to 
selectively identify or capture DNA or RNA of specific sequence composition, and the 
amount of RNA or cDNA hybridized to a known capture sequence determined qualitatively 

25 or quantitatively, to provide information about the relative representation of a particular 
message within the pool of cellular messages in a sample. Hybridization analysis can be 
designed to allow for concurrent screening of the relative expression of hundreds to 
thousands of genes by using, for example, array-based technologies having high density 
formats, including filters, microscope slides, or microchips, or solution-based technologies 

30 that use spectroscopic analysis (e.g., mass spectrometry). One exemplary use of arrays in the 
diagnostic methods of the invention is described below in more detail. 
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Use of a single gene in diagnostic applications. The diagnostic methods of the 

invention can focus on the expression of a single differentially expressed gene. For example, 

the diagnostic method can involve detecting a differentially expressed gene, or a 

polymorphism of such a gene (e.g., a polymorphism in an coding region or control region), 

5 that is associated with disease. Disease-associated polymorphisms can include deletion or 

truncation of the gene, mutations that alter expression level and/or affect activity of the 

encoded protein, etc. 

Changes in the promoter or enhancer sequence that affect expression levels of an 

differentially gene can be compared to expression levels of the normal allele by various 

10 methods known in the art. Methods for determining promoter or enhancer strength include 
quantitation of the expressed natural protein; insertion of the variant control element into a 
vector with a reporter gene such as P-galactosidase, luciferase, chloramphenicol 
acetyltransferase, etc. that provides for convenient quantitation; and the like. 

A number of methods are available for analyzing nucleic acids for the presence of a 

15 specific sequence, e.g. a disease associated polymorphism. Where large amounts of DNA 
are available, genomic DNA is used directly. Alternatively, the region of interest is cloned 
into a suitable vector and grown in sufficient quantity for analysis. Cells that express a 
differentially expressed gene can be used as a source of mRNA, which can be assayed 
directly or reverse transcribed into cDNA for analysis. The nucleic acid can be amplified by 

20 conventional techniques, such as the polymerase chain reaction (PCR), to provide sufficient 
amounts for analysis, and a detectable label can be included in the amplification reaction 
(e.g., using a detectably labeled primer or detectably labeled oligonucleotides) to facilitate 
detection. The use of the polymerase chain reaction is described in Saiki, et ai, Science 
(1985) 239:487, and a review of techniques can be found in Sambrook, et al., Molecular 

25 Cloning: A Laboratory Manual, (1989) pp. 14.2. Alternatively, various methods are known 
in the art that utilize oligonucleotide ligation as a means of detecting polymorphisms, for 
examples see Riley et ah, Nucl Acids Res. (1990) 75:2887; and Delahunty et al, Am. J. 
Hum. GeneL (1996) 55:1239. 

The sample nucleic acid, e.g. amplified or cloned fragment, is analyzed by one of a 

30 number of methods known in the art. The nucleic acid can be sequenced by dideoxy or other 
methods, and the sequence of bases compared to a selected sequence, e.g., to a wild-type 
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sequence. Hybridization with the polymorphic or variant sequence can also be used to 

determine its presence in a sample (e.g., by Southern blot, dot blot, etc.). The hybridization 

pattern of a polymorphic or variant sequence and a control sequence to an array of 

oligonucleotide probes immobilized on a solid support, as described in US 5,445,934, or in 

5 WO 95/35505, can also be used as a means of identifying polymorphic or variant sequences 

associated with disease. Single strand conformational polymorphism (SSCP) analysis, 

denaturing gradient gel electrophoresis (DGGE), and heteroduplex analysis in gel matrices 

are used to detect conformational changes created by DNA sequence variation as alterations 

in electrophoretic mobility. Alternatively, where a polymorphism creates or destroys a 

10 recognition site for a restriction endonuclease, the sample is digested with that endonuclease, 

and the products size fractionated to determine whether the fragment was digested. 

Fractionation is performed by gel or capillary electrophoresis, particularly acrylamide or 

agarose gels. 

Screening for mutations in an differentially expressed gene can be based on the 

1 5 functional or antigenic characteristics of the protein. Protein truncation assays are useful in 
detecting deletions that can affect the biological activity of the protein. Various 
immunoassays designed to detect polymorphisms in proteins can be used in screening. 
Where many diverse genetic mutations lead to a particular disease phenotype, functional 
protein assays have proven to be effective screening tools. The activity of the encoded 

20 protein can be determined by comparison with the wild-type protein. 

Pattern matching in diagnosis using arrays. In another embodiment, the diagnostic 
and/or prognostic methods of the invention involve detection of expression of a selected set 
of genes in a test sample to produce a test expression pattern (TEP). The TEP is compared to 
a reference expression pattern (REP), which is generated by detection of expression of the 

25 selected set of genes in a reference sample (e.g., a positive or negative control sample). The 
selected set of genes includes at least one of the genes of the invention, which genes 
correspond to the polynucleotide sequences of SEQ ID NOS: 1 -844. Of particular interest is 
a selected set of genes that includes gene differentially expressed in the disease for which the 
test sample is to be screened. 

30 "Reference sequences" or "reference polynucleotides" as used herein in the context of 

differential gene expression analysis and diagnosis/prognosis refers to a selected set of 
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polynucleotides, which selected set includes at least one or more of the differentially 

expressed polynucleotides described herein. A plurality of reference sequences, preferably 

comprising positive and negative control sequences, can be included as reference sequences. 

Additional suitable reference sequences are found in Genbank, Unigene, and other 

5 nucleotide sequence databases (including, e.g., expressed sequence tag (EST), partial, and 

full-length sequences). 

"Reference array" means an array having reference sequences for use in hybridization 

with a sample, where the reference sequences include all, at least one of, or any subset of the 

differentially expressed polynucleotides described herein. Usually such an array will include 

10 at least 3 different reference sequences, and can include any one or all of the provided 
differentially expressed sequences. Arrays of interest can further comprise sequences, 
including polymorphisms, of other genetic sequences, particularly other sequences of interest 
for screening for a disease or disorder (e.g., cancer, dysplasia, or other related or unrelated 
diseases, disorders, or conditions). The oligonucleotide sequence on the array will usually 

15 be at least about 12 nt in length, and can be of about the length of the provided sequences, or 
can extend into the flanking regions to generate fragments of 100 nt to 200 nt in length or 
more. 

A "reference expression pattern" or "REP" as used herein refers to the relative levels 
of expression of a selected set of genes, particularly of differentially expressed genes, that is 
20 associated with a selected cell type, e.g., a normal cell, a cancerous cell, a cell exposed to an 
environmental stimulus, and the like. A "test expression pattern" or "TEP" refers to relative 
levels of expression of a selected set of genes, particularly of differentially expressed genes, 
in a test sample (e.g., a cell of unknown or suspected disease state, from which mRNA is 
isolated). 

25 "Diagnosis" as used herein generally includes determination of a subjects 

susceptibility to a disease or disorder, determination as to whether a subject is presently 
affected by a disease or disorder, as well as to the prognosis of a subject affected by a disease 
or disorder (e.g., identification of pre-metastatic or metastatic cancerous states, stages of 
cancer, or responsiveness of cancer to therapy). The present invention particularly 

30 encompasses diagnosis of subjects in the context of breast cancer (e.g. , carcinoma in situ 
(e.g., ductal carcinoma in situ), estrogen receptor (ER)-positive breast cancer, ER-negative 
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breast cancer, or other forms and/or stages of breast cancer), lung cancer (e.g., small cell 

carcinoma, non-small cell carcinoma, mesothelioma, and other forms and/or stages of lung 

cancer), and colon cancer (e.g., adenomatous polyp, colorectal carcinoma, and other forms 

and/or stages of colon cancer). 

5 "Sample" or "biological sample" as used throughout here are generally meant to refer 

to samples of biological fluids or tissues, particularly samples obtained from tissues, 

especially from cells of the type associated with the disease for which the diagnostic 

application is designed (e.g., ductal adenocarcinoma), and the like. "Samples" is also meant 

to encompass derivatives and fractions of such samples (e.g., cell lysates). Where the sample 

10 is solid tissue, the cells of the tissue can be dissociated or tissue sections can be analyzed. 

REPs can be generated in a variety of ways according to methods well known in the 
art. For example, REPs can be generated by hybridizing a control sample to an array having 
a selected set of polynucleotides (particularly a selected set of differentially expressed 
polynucleotides), acquiring the hybridization data from the array, and storing the data in a 

15 format that allows for ready comparison of the REP with a TEP. Alternatively, all expressed 
sequences in a control sample can be isolated and sequenced, e.g., by isolating mRNA from 
a control sample, converting the mRNA into cDNA, and sequencing the cDNA. The 
resulting sequence information roughly or precisely reflects the identity and relative number 
of expressed sequences in the sample. The sequence information can then be stored in a 

20 format (e.g. , a computer-readable format) that allows for ready comparison of the REP with 
a TEP. The REP can be normalized prior to or after data storage, and/or can be processed to 
selectively remove sequences of expressed genes that are of less interest or that might 
complicate analysis (e.g., some or all of the sequences associated with housekeeping genes 
can be eliminated from REP data). 

25 TEPs can be generated in a manner similar to REPs, e.g., by hybridizing a test sample 

to an array having a selected set of polynucleotides, particularly a selected set of 
differentially expressed polynucleotides, acquiring the hybridization data from the array, and 
storing the data in a format that allows for ready comparison of the TEP with a REP. The 
REP and TEP to be used in a comparison can be generated simultaneously, or the TEP can 

30 be compared to previously generated and stored REPs. 
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In one embodiment of the invention, comparison of a TEP with a REP involves 

hybridizing a test sample with a reference array, where the reference array has one or more 

reference sequences for use in hybridization with a sample. The reference sequences include 

all, at least one of, or any subset of the differentially expressed polynucleotides described 

5 herein. Hybridization data for the test sample is acquired, the data normalized, and the 

produced TEP compared with a REP generated using an array having the same or similar 

selected set of differentially expressed polynucleotides. Probes that correspond to sequences 

differentially expressed between the two samples will show decreased or increased 

hybridization efficiency for one of the samples relative to the other. 

10 Reference arrays can be produced according to any suitable methods known in the 

art. For example, methods of producing large arrays of oligonucleotides are described in 
U.S. 5,134,854, and U.S. 5,445,934 using light-directed synthesis techniques. Using a 
computer controlled system, a heterogeneous array of monomers is converted, through 
simultaneous coupling at a number of reaction sites, into a heterogeneous array of polymers. 

15 Alternatively, microarrays are generated by deposition of pre-synthesized oligonucleotides 
onto a solid substrate, for example as described in PCT published application no. 
WO 95/35505. 

Methods for collection of data from hybridization of samples with a reference arrays 
are also well known in the art. For example, the polynucleotides of the reference and test 

20 samples can be generated using a detectable fluorescent label, and hybridization of the 

polynucleotides in the samples detected by scanning the microarrays for the presence of the 
detectable label Methods and devices for detecting fluorescently marked targets on devices 
are known in the art. Generally, such detection devices include a microscope and light 
source for directing light at a substrate. A photon counter detects fluorescence from the 

25 substrate, while an x-y translation stage varies the location of the substrate. A confocal 
detection device that can be used in the subject methods is described in U.S. Patent no. 
5,631,734. A scanning laser microscope is described in Shalon et al., Genome Res. (1996) 
d:639. A scan, using the appropriate excitation line, is performed for each fluorophore used. 
The digital images generated from the scan are then combined for subsequent analysis. For 

30 any particular array element, the ratio of the fluorescent signal from one sample (e.g., a test 
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sample) is compared to the fluorescent signal from another sample {e.g., a reference sample), 

and the relative signal intensity determined. 

Methods for analyzing the data collected from hybridization to arrays are well known 

in the art. For example, where detection of hybridization involves a fluorescent label, data 

5 analysis can include the steps of determining fluorescent intensity as a function of substrate 

position from the data collected, removing outliers, i.e. data deviating from a predetermined 

statistical distribution, and calculating the relative binding affinity of the targets from the 

remaining data. The resulting data can be displayed as an image with the intensity in each 

region varying according to the binding affinity between targets and probes. 

10 In general, the test sample is classified as having a gene expression profile 

corresponding to that associated with a disease or non-disease state by comparing the TEP 
generated from the test sample to one or more REPs generated from reference samples (e.g., 
from samples associated with cancer or specific stages of cancer, dysplasia, samples affected 
by a disease other than cancer, normal samples, etc.). The criteria for a match or a 

1 5 substantial match between a TEP and a REP include expression of the same or substantially 
the same set of reference genes, as well as expression of these reference genes at 
substantially the same levels (e.g., no significant difference between the samples for a signal 
associated with a selected reference sequence after normalization of the samples, or at least 
no greater than about 25% to about 40% difference in signal strength for a given reference 

20 sequence. In general, a pattern match between a TEP and a REP includes a match in 

expression, preferably a match in qualitative or quantitative expression level, of at least one 
of, all or any subset of the differentially expressed genes of the invention. 

Pattern matching can be performed manually, or can be performed using a computer 
program. Methods for preparation of substrate matrices (e.g., arrays), design of 

25 oligonucleotides for use with such matrices, labeling of probes, hybridization conditions, 
scanning of hybridized matrices, and analysis of patterns generated, including comparison 
analysis, are described in, for example, U.S. 5,800,992. 

F. Use of the Polynucleotides of the Invention in Cancer 
Oncogenesis involves the unbridled growth, dedifferentiation and abnormal 

30 migration of cells. Cancerous cells can have the ability to compress, invade, and destroy 
normal tissue. Cancerous cells may also metastasize to other parts of the body via the 
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bloodstream or the lymph system and colonize in these other areas. Different cancers are 

classified by the cell from which the cancerous cell is derived and from its cellular 

morphology and/or state of differentiation. 

Somatic genetic abnormalities cause cancer initiation and progression. Cancer 

5 generally is clonally formed, i.e. gain of function of oncogenes and loss of function of tumor 

suppressor genes within a single cell transform the cell to be cancerous, and that single cell 

grows and divides to form a cancerous lesion. The genes known to be involved in cancer 

initiation and progression are involved in numerous cellular functions, including 

developmental differentiation, cell cycle regulation, cell signaling, immunological response, 

10 DNA replication, and DNA repair. 

The identification and characterization of genetic or biochemical markers in blood or 
tissues that will detect the earliest changes along the carcinogenesis pathway and monitor the 
efficacy of various therapies and preventive interventions is a major goal of cancer research. 
Scientists have identified genetic changes in stool specimens that indicate the stages of colon 

15 cancer, and other biomarkers such as gene mutations, hormone receptors, proteins that 
inhibit metastasis, and enzymes that metabolize drugs are all being used to determine the 
severity and predict the course of breast, prostate, lung, and other cancers. 

Recent advances in the pathogenesis of certain cancers has been helpful in 
determining patient treatment. The level of expression of certain polynucleotides can be 

20 indicative of a poorer prognosis, and therefore warrant more aggressive chemo- or radio- 
therapy for a patient. The correlation of novel surrogate tumor specific features with 
response to treatment and outcome in patients has defined certain prognostic indicators that 
allow the design of tailored therapy based on the molecular profile of the tumor. These 
therapies include antibody targeting and gene therapy. Moreover, a promising level of one 

25 or more marker polynucleotides can provide impetus for not aggressively treating a 

particular patient, thus sparing the patient the deleterious side effects of aggressive therapy. 
Determining expression of certain polynucleotides and comparison of a patients profile with 
known expression in normal tissue and variants of the disease allows a determination of the 
best possible treatment for a patient, both in terms of specificity of treatment and in terms of 

30 comfort level of the patient. 
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Surrogate tumor markers, such as polynucleotide expression, can also be used to 

better classify, and thus diagnose and treat, different forms and disease states of cancer. 

Two classifications widely used in oncology that can benefit from identification of the 

expression levels of the polynucleotides of the invention are staging of the cancerous 

5 disorder, and grading the nature of the cancerous tissue. 

Staging. Staging is a process used by physicians to describe how advanced the 

cancerous state is in a patient. Staging assists the physician in determining a prognosis, 

planning treatment and evaluating the results of such treatment. Different staging systems 

are used for different types of cancer, but each generally involves the following 

10 determinations: the type of tumor, indicated by T; whether the cancer has metastasized to 
nearby lymph nodes, indicated by N; and whether the cancer has metastasized to more 
distant parts of the body, indicated by M. This system of staging is called the TNM system. 
Generally, if a cancer is only detectable in the area of the primary lesion without having 
spread to any lymph nodes it is called Stage I. If it has spread only to the closest lymph 

1 5 nodes, it is called Stage II. In Stage III, the cancer has generally spread to the lymph nodes 
in near proximity to the site of the primary lesion. Cancers that have spread to a distant part 
of the body, such as the liver, bone, brain or another site, are called Stage IV, the most 
advanced stage. 

Currently, the determination of staging is done using pathological techniques and is 
20 based more on the presence or absence of malignant tissue rather than the characteristics of 
the tumor type. Presence or absence of malignant tissue is based primarily on the gross 
morphology of the cells in the areas biopsied. The polynucleotides of the invention can 
facilitate fine-tuning of the staging process by identifying markers for the aggresivity of a 
cancer, e.g. the metastatic potential, as well as the presence in different areas of the body. 
25 Thus, a Stage II cancer with a polynucleotide signifying a high metastatic potential cancer 
can be used to change a borderline Stage II tumor to a Stage III tumor, justifying more 
aggressive therapy. Conversely, the presence of a polynucleotide signifying a lower 
metastatic potential allows more conservative staging of a tumor. 

Grading of cancers. Grade is a term used to describe how closely a tumor resembles 
30 normal tissue of its same type. Based on the microscopic appearance of a tumor, 

pathologists will identify the grade of a tumor based on parameters such as cell morphology, 
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cellular organization, and other markers of differentiation. As a general rule, the grade of a 

tumor corresponds to its rate of growth or aggressiveness, i That is, undifferentiated or high- 
grade tumors grow more quickly than well differentiated or low-grade tumors. Information 
about tumor grade is useful in planning treatment and predicting prognosis. 
5 The American Joint Commission on Cancer has recommended the following 

guidelines for grading tumors: 1) GX Grade cannot be assessed; 2) Gl Well differentiated; 
G2 Moderately well differentiated; 3) G3 Poorly differentiated; 4) G4 Undifferentiated. 
Although grading is used by pathologists to describe most cancers, it plays a more important 
role in treatment planning for certain types than for others. An example is the Gleason 
10 system that is specific for prostate cancer, which uses grade numbers to describe the degree 
of differentiation. Lower Gleason scores indicate well-differentiated cells. Intermediate 
scores denote tumors with moderately differentiated cells. Higher scores describe poorly 
differentiated cells. Grade is also important in some types of brain tumors and soft tissue 
sarcomas. 

15 The polynucleotides of the invention can be especially valuable in determining the 

grade of the tumor, as they not only can aid in determining the differentiation status of the 
cells of a tumor, they can also identify factors other than differentiation that are valuable in 
determining the aggressivity of a tumor, such as metastatic potential. 

Familial Cancer Genes. A number of cancer syndromes are linked to Mendelian 

20 inheritance of a predisposition to develop particular cancers. The following table contains a 
list of cancer types that can be inherited, and for which the gene or genes responsible have 
been identified. Most of the cancer types listed can occur as part of several different genetic 
conditions, each caused by alterations in a different gene. 



Cancer Type 


Genetic Condition 


Gene 


Brain 




Li-Fraumeni syndrome 
Neurofibromatosis 1 
Neurofibromatosis 2 
von Hippel-Lindau syndrome 
Tuberous sclerosis 2 


TP53 

NFl 

NF2 

VHL 

TSC2 


Breast 




Hereditary breast/ovarian cancer 1 
Hereditary breast/ovarian cancer 2 
Li-Fraumeni syndrome 
Ataxia telangiectasia 


BRCAl 
BRCA2 
TP53 
ATM 


Colon 




Familial adenomatous polyposis (FAP) 
Hereditary non-polyposis colon cancer (HNPCC) 1 
Hereditary non-polyposis colon cancer (HNPCC) 2 


APC 

HMSH2 

hMLHl 
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Cancer Type 


Genetic Condition 
Hereditary non-polyposis colon cancer (HNPCC) 3 
Hereditary non-polyposis colon cancer (HNPCC) 4 


Gene 
hPMSl 
hPMS2 


Endocrine 

(parathyroid, pituitary, GI endocrine) 


Multiple endocrine neoplasia 1 (MEN1) 


MEN1 


Endocrine 

(pheochromacytoma, medullary thyroid) 


Multiple endocrine neoplasia 2 (MEN2) 


RET 


Endometrial 


Hereditary non-polyposis colon cancer (HNPCC) 1 
Hereditary non-polyposis colon cancer (HNPCC) 2 
Hereditary non-polyposis colon cancer (HNPCC) 3 
Hereditary non-polyposis colon cancer (HNPCC) 4 


hMSH2 
hMLHl 
hPMSl 
hPMS2 


Eye 


Hereditary retinoblastoma 


RBI 


Hematologic 

flvmnhoirias and leukemia) 


Li-Fraumeni syndrome 
Ataxia telangiectasia 


TP53 
ATM 


Kidney 


Hereditary Wilms' tumor 
von HitJDel-Lindau svndrome 
Tuberous sclerosis 2 


WT1 
VHL 
TSC2 


Ovary 


Hereditary breast/ovarian cancer 1 
Hereditary breast/ovarian cancer 2 


BRCAl 
BRCA2 


Sarcoma 


Hereditary retinoblastoma 
Li-Fraumeni syndrome 
Neurofibromatosis 1 


RBI 

TP53 

NF1 


Skin 


Hereditary melanoma 1 
Hereditary melanoma 2 
Basal cell naevus (Gorlin) syndrome 


CDKN2 

CDK4 

PTCH 


Stomach 


Hereditary non-polyposis colon cancer (HNPCC) 1 
Hereditary non-polyposis colon cancer (HNPCC) 2 
Hereditary non-polyposis colon cancer (HNPCC) 3 
Hereditary non-polyposis colon cancer (HNPCC) 4 


hMSH2 
hMLHl 
hPMSl 
hPMS2 



The polynucleotides of the invention can be especially useful to monitor patients having any 
of the above syndromes to detect potentially malignant events at a molecular level before 
they are detectable at a gross morphological level. As can be seen from the table, a number 

5 of genes are involved in multiple forms of cancer. Thus, a polynucleotide of the invention 
identified as important for metastatic colon cancer can also have clinical implications for a 
patient diagnosed with stomach cancer or endometrial cancer. 

Lung Cancer. Lung cancer is one of the most common cancers in the United States, 
accounting for about 1 5 percent of all cancer cases, or 170,000 new cases each year. At this 

10 time, over half of the lung cancer cases in the United States are in men, but the number found 
in women is increasing and will soon equal that in men. Today more women die of lung 
cancer than of breast cancer. Lung cancer is especially difficult to diagnose and treat 
because of the large size of the lungs, which allows cancer to develop for years undetected. 
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In fact, lung cancer can spread outside the lungs without causing any symptoms. Adding to 

the confusion, the most common symptom of lung cancer, a persistent cough, can often be 

mistaken for a cold or bronchitis. 

Although there are more than a dozen different kinds of lung cancer, the two main 

5 types of lung cancer are small cell and nonsmall cell, which encompass about 90% of all 

lung cancer cases. Small cell carcinoma (also called oat cell carcinoma), which usually starts 

in one of the larger bronchial tubes, grows fairly rapidly, and is likely to be large by the time 

of diagnosis. Nonsmall cell lung cancer (NSCLC) is made up of three general subtypes of 

lung cancer. Epidermoid carcinoma (also called squamous cell carcinoma) usually starts in 

10 one of the larger bronchial tubes and grows relatively slowly. The size of these tumors can 
range from very small to quite large. Adenocarcinoma starts growing near the outside 
surface of the lung and can vary in both size and growth rate. Some slowly growing 
adenocarcinomas are described as alveolar cell cancer. Large cell carcinoma starts near the 
surface of the lung, grows rapidly, and the growth is usually fairly large when diagnosed. 

15 Other less common forms of ljing cancer are carcinoid, cylindroma, mucoepidermoid, and 
malignant mesothelioma. 

Currently, CT scans, MRIs, X-rays, sputum cytology, and biopsies are used to 
diagnose nonsmall cell lung cancer. The form and cellular origin of the lung cancer is 
diagnosed primarily through biopsy from either a surgical biopsy or a needle aspiration of 

20 lung tissue, and usually the biopsy is prompted from an abnormality identified on an X-ray. 
In some cases, sputum cytology can reveal lung cancers in patients with normal X-rays or 
can determine the type of lung cancer, but because it cannot pinpoint the tumor's location, a 
positive sputum cytology test is usually followed by further tests. Since these tests are based 
in large part on gross morphology of the tissue, the diagnosis of a particular kind of tumor is 

25 largely subjective, and the diagnosis can vary significantly between clinicians. 

The polynucleotides of the invention can be used to distinguish types of lung cancer 
as well as identifying traits specific to a certain patient's cancer. For example, if the patient's 
biopsy expresses a polynucleotide that is associated with a low metastatic potential, it may 
justify leaving a larger portion of the patient's lung in surgery to remove the lesion. 

30 Alternatively, a smaller lesion with expression of a polynucleotide that is associated with 
high metastatic potential may justify a more radical removal of lung tissue and/or the 
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surrounding lymph nodes, even if no metastasis can be identified through pathological 

examination. 

Similarly, the expression of polynucleotides of the invention can be used in the 
diagnosis, prognosis and management of colorectal cancer. The differential expression of a 
5 polynucleotide in hyperplasia can be used as a diagnostic marker for metastatic lung cancer. 
The polynucleotides of the invention that would be especially useful for this purpose are 
those that exhibit differential expression between high metastatic versus low metastatic lung 
cancer , i.e. SEQ ID NOS: 9, 34, 42, 62, 74, 106, 1 19, 135, 154, 160, 260, 308, 323, 349, 
361, 369, 371, 381, 395, and 400. Detection of malignant lung cancer with a higher 
10 metastatic potential can be determined using expression levels of any of these sequences 
alone or in combination with the levels of expression of other known genes. 

Breast Cancer. The National Cancer Institute (NCI) estimates that about 1 in 8 
women in the United States will develop breast cancer during her lifetime. Clinical breast 
examination and mammography are recommended as combined modalities for breast cancer 
1 5 screening, and the nature of the cancer will often depend upon the location of the tumor and 
the cell type from which the tumor is derived. The majority of breast cancers are 
adenocarcinomas subtypes, which can be summarized as follows: 

Ductal carcinoma in situ (DCIS): Ductal carcinoma in situ is the most common type 
of noninvasive breast cancer. In DCIS, the malignant cells have not metastasized through 
20 the walls of the ducts into the fatty tissue of the breast. Comedocarcinoma is a type of DCIS 
that is more likely than other types of DCIS to come back in the same area after 
lumpectomy. It is more closely linked to eventual development of invasive ductal carcinoma 
than other forms of DCIS. 

Infiltrating (or invasive) ductal carcinoma (IDC): this type of cancer has metastasized 
25 through the wall of the duct and invaded the fatty tissue of the breast. At this point, it has the 
potential to use the lymphatic system and bloodstream for metastasis to more distant parts of 
the body. Infiltrating ductal carcinoma accounts for about 80% of breast cancers. 

Lobular carcinoma in situ (LCIS): While not a true cancer, LCIS (also called lobular 
neoplasia) is sometimes classified as a type of noninvasive breast cancer. It does not 
30 penetrate through the wall of the lobules. Although it does not itself usually become an 



67 



WO 99/33982 PCT/US98/2761 0 

invasive cancer, women with this condition have a higher risk of developing an invasive 

breast cancer in the same breast, or in the opposite breast. 

Infiltrating (or invasive) lobular carcinoma (ILC): ILC is similar to IDC, in that it has 
the potential metastasize elsewhere in the body. About 10% to 15% of invasive breast 
5 cancers are invasive lobular carcinomas. ILC can be more difficult to detect by 
mammogram than IDC. 

Inflammatory breast cancer; This rare type of invasive breast cancer accounts for 
about 1% of all breast cancers and is extremely aggressive. Multiple skin symptoms 
associated with this cancer are caused by cancer cells blocking lymph vessels or channels in 
1 0 the skin over the breast. 

Medullary carcinoma: This special type of infiltrating breast cancer has a relatively 
well defined, distinct boundary between tumor tissue and normal tissue. It accounts for 
about 5% of breast cancers. The prognosis for this kind of breast cancer is better than for 
other types of invasive breast cancer. 
1 5 Mucinous carcinoma: This rare type of invasive breast cancer originates from mucus- 

producing cells. The prognosis for mucinous carcinoma is better than for the more common 
types of invasive breast cancer. 

Paget's disease of the nipple: This type of breast cancer starts in the ducts and spreads 
to the skin of the nipple and the areola. It is a rare type of breast cancer, occurring in only 
20 1% of all cases. Paget's disease can be associated with in situ carcinoma, or with infiltrating 
breast carcinoma. If no lump can be felt in the breast tissue, and the biopsy shows DCIS but 
no invasive cancer, the prognosis is excellent. 

Phyllodes tumor: This very rare type of breast tumor forms from the stroma of the 
breast, in contrast to carcinomas which develop in the ducts or lobules. Phyllodes (also 
25 spelled phylloides) tumors are usually benign, but are malignant on rare occasions. 

Nevertheless, malignant phyllodes tumors are very rare and less than 10 women per year in 
the US die of this disease. Benign phyllodes tumors are successfully treated by removing the 
mass and a narrow margin of normal breast tissue. 

Tubular carcinoma: Accounting for about 2% of all breast cancers, tubular 
30 carcinomas are a special type of infiltrating breast carcinoma. They have a better prognosis 
than usual infiltrating ductal or lobularcarcinomas. 
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High-quality mammography combined with clinical breast exam remains the only 

screening method clearly tied to reduction in breast cancer mortality. Lower dose x-rays, 

digitized computer rather than film images, and the use of computer programs to assist 

diagnosis, are almost ready for widespread dissemination. Other technologies also are being 

5 developed, including magnetic resonance imaging and ultrasound. In addition, a very low 

radiation exposure technique, positron emission tomography has the potential for detecting 

early breast cancer. 

It is also possible to differentiate between non-cancerous breast tissue and malignant 
breast tissue by analyzing differential gene expression between tissues. In addition, there 

10 may be several possible alterations that lead to the various possible types of breast cancer. 
The different types of breast tumors (e.g., invasive vs. non-invasive, ductal vs. axillary 
lymph node) can be differentiable from one another by the identification of the differences in 
genes expressed by different types of breast tumor tissues (Porter- Jordan et al, Hematol 
Oncol Clin North Am (1994) 5:73). Breast cancer can thus be generally diagnosed by 

1 5 detection of expression of a gene or genes associated with breast tumors. Where enough 
information is available about the differential gene expression between various types of 
breast tumor tissues, the specific type of breast tumor can also be diagnosed. 

For example, increased estrogen receptor (ER) expression in normal breast 
epithileum, while not itself indicative of malignant tissue, is a known risk marker for 

20 development of breast cancer. Khan SA et al. y Cancer Res (1 994) 54:993. Malignant breast 
cancer is often divided into two groups, ER-positive and ER-negative, based on the estrogen 
receptor status of the tissue. The ER status represents different survival length and response 
to hormone therapy, and is thought to represent either: 1) an indicator of different stages of 
the disease, or 2) an indicator that allows differentiation between two similar but distinct 

25 diseases. K. Zhu et aL, Med. Hypoth. (1 997) 49:69. A number of other genes are known to 
vary expression between either different stages of cancer or different types of similar breast 
cancer. 

Similarly, the expression of polynucleotides of the invention can be used in the 
diagnosis and management of breast cancer. The differential expression of a polynucleotide 
30 in human breast tumor tissue can be used as a diagnostic marker for human breast cancer. 
The polynucleotides of the invention that would be especially useful for this purpose are 
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those that exhibit differential expression between breast cancer tissue with a high metastatic 

potential and a low metastatic potential, i.e. SEQ ID NOS: 9, 42, 52, 62, 65, 66, 68, 1 14, 

123, 144, 172, 178,214,219,223,258, 317, and 379. Detection of breast cancer can be 

determined using expression levels of any of these sequences alone or in combination. 

5 Determination of the aggressive nature and/or the metastatic potential of a breast cancer can 

also be determined by comparing levels of one or more polynucleotides of the invention and 

comparing levels of another sequence known to vary in cancerous tissue, e.g. ER expression. 

In addition, development of breast cancer can be detected by examining the ratio of SEQ ID 

NO: to the levels of steroid hormones (e.g., testosterone or estrogen) or to other hormones 

10 (e.g. , growth hormone, insulin). Thus expression of specific marker polynucleotides can 
be used to discriminate between normal and cancerous breast tissue, to discriminate between 
breast cancers with different cells of origin, to discriminate between breast cancers with 
different potential metastatic rates, etc. 

Diagnosis of breast cancer can also involve comparing the expression of a 

15 polynucleotide of the invention with the expression of other sequences in non-malignant 
breast tissue samples in comparison to one or more forms of the diseased tissue. A 
comparison of expression of one or more polynucleotides of the invention between the 
samples provides information on relative levels of these polynucleotides as well as the ratio 
of these polynucleotides to the expression of other sequences in the tissue of interest 

20 compared to normal. 

This risk of breast cancer is elevated significantly by the presence of an inherited risk 
for breast cancer, such as a mutation in BRCA-1 or BRCA-2. New diagnostic tools are 
being developed to address the needs of higher risk patients to complement mammography 
and physical examinations for early detection of breast cancer, particularly among younger 

25 women. The presence of antigen or expression markers in nipple aspirate fluid (NAF) 
samples collected from one or both breasts can be useful for useful for risk assessment or 
early cancer detection. Breast cytology and biomarkers obtained by random fine needle 
aspiration have been used to identify hyperplasia with atypia and overexpression of p53 and 
EGFR. The polynucleotides of the invention can be used in multivariate analysis with 

30 expression studies with genes such as p53 and EGFR as risk predictors and as surrogate 
endpoint biomarkers for breast cancer. 
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As well as being used for diagnosis and risk assessment, the expression of certain 

genes can also correlated to prognosis of a disease state. The expression of particular gene 

have been used as prognostic indicators for breast cancer including increased expression of 

c-erbB-2, pS2, ER, progesterone receptor, epidermal growth factor receptor (EGFR), neu 9 

5 myc, bcl-2, int2 y cytosolic tyrosine kinase, cyclin E, prad-1, hst, uPA, PAI-1 , PAI-2, 

cathepsin D, as well as the presence of a number of cancer-specific antigens, e,g. CEA, CA 

M26, CA M29 and CA 15.3. Davis, Br J. BiomedScL (1996) 53:157. Poor prognosis has 

also been linked to a decrease in expression of certain genes, such as p53, Rb, nm23. The 

expression of the polynucleotides of the invention can be of prognostic value for determining 

10 the metastatic potential of a malignant breast cancer, as this molecules are differentially 
expressed between high and low metastatic potential tissues tumors. The levels of these 
polynucleotides in patients with malignant breast cancer can compared to normal tissue, 
malignant tissue with a known high potential metastatic level, and malignant tissue with a 
known lower level of metastatic potential to provide a prognosis for a particular patient. 

15 Such a prognosis is predictive of the extent and nature of the cancer. The determined 
prognosis is useful in determining the prognosis of a patient with breast cancer, both for 
initial treatment of the disease and for longer-term monitoring of the same patient. If 
samples are taken from the same individual over a period of time, differences in 
polynucleotide expression that are specific to that patient can be identified and closely 

20 watched. 

Colon Cancer. Colorectal cancer is one of the most common neoplasms in humans 
and perhaps the most frequent form of hereditary neoplasia. Prevention and early detection 
are key factors in controlling and curing colorectal cancer. Indeed, colorectal cancer is the 
second most preventable cancer, after lung cancer. Colorectal cancer begins as polyps, 

25 which are small, benign growths of cells that form on the inner lining of the colon. Over a 
period of several years, some of these polyps accumulate additional mutations and become 
cancerous. About 20 percent of all cases of colon cancer are thought to be related to 
heredity. Currently, multiple familial colorectal cancer disorders have been identified, which 
are summarized as follows: 

30 Familial adenomatous polyposis (FAP): This condition results in a person having 

hundreds or even thousands of polyps in the colon and rectum that usually first appear during 
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the teenage years. Cancer nearly always develops in one or more of these polyps between 

the ages of 30 and 50. 

Gardner's syndrome: Like FAP, Gardner's syndrome results in polyps and colorectal 

cancers that develop at a young age. It can also cause benign tumors of the skin, soft 
5 connective tissue and bones. 

Hereditary nonpolyposis colon cancer (HNPCC): People with this condition tend to 

develop colorectal cancer at a young age, without first having many polyps. HNPCC has an 

autosomal dominant pattern of inheritance with variable but high penetrance estimated to be 

about 90%. HNPCC underlies 0.5%- 10% of all cases of colorectal cancer. An understanding 
1 0 of the mechanisms behind the development of HNPCC is emerging, and genetic 

presymptomatic testing, now being conducted in research settings, soon will be available on 

a widespread basis for individuals identified at risk for this disease. 

Familial colorectal cancer in Ashkenazi Jews: Recent research has found an inherited 

tendency to developing colorectal cancer among some Jews of Eastern European descent. 
15 Like people with FAP, Gardner's syndrome, and HNPCC, their increased risk is due to an 

inherited mutation present in about 6% of American Jews. 

Several tests are currently used to screen for colorectal cancer, including digital rectal 

examination, fecal occult blood test, sigmoidoscopy, colonoscopy, virtual colonoscopy and 

MRI. Each of these tests identifies potential colorectal cancer lesions, or a risk of 
20 development of these lesions, at a fairly gross morphological level. 

The sequential alteration of a number of genes is associated with malignant 

adenocarcinoma, including the genes DCC, p53, ras, and FAP. For a review, see e.g. Fearon 

ER, et ai, Cell (1990) 61(5):159\ Hamilton SR et al 9 Cancer (1993) 72:957; Bodmer W, et 

al , Nat Genet ( 1 994) 4(3):2\1\ Fearon ER, Ann N Y Acad ScL ( 1 995) 768: 101. Molecular 
25 genetic alterations are thus promising as potential diagnostic and prognostic indicators in 

colorectal carcinoma and molecular genetics of colorectal carcinoma since it is possible to 

differentiate between different types of colorectal neoplasias using molecular markers. 

Colorectal cancer can thus be generally diagnosed by detection of expression of a gene or 

genes associated with colorectal tumors. 
30 Similarly, the expression of polynucleotides of the invention can be used in the 

diagnosis, prognosis and management of colorectal cancer. The differential expression of a 
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polynucleotide in hyperplasia can be used as a diagnostic marker for colon cancer. The 

polynucleotides of the invention that would be especially useful for this purpose are those 

that exhibit differential expression between malignant metastatic colon cancer and normal 

patient tissue , i.e. SEQ ID NOS: 52, 119, 172, 288. Detection of malignant colon cancer 

5 can be determined using expression levels of any of these sequences alone or in combination 

with the levels of expression. 

Determination of the aggressive nature and/or the metastatic potential of a colon 

cancer can also be determined by comparing levels of one or more polynucleotides of the 

invention and comparing total levels of another sequence known to vary in cancerous tissue, 

10 e.g. p53 expression. In addition, development of colon cancer can be detected by examining 
the ratio of any of the polynucleotides of the invention to the levels of oncogenes (e.g. ras) 
or tumor suppressor genes (e.g. FAP or p53). Thus expression of specific marker 
polynucleotides can be used to discriminate between normal and cancerous breast tissue, to 
discriminate between breast cancers with different cells of origin, to discriminate between 

15 breast cancers with different potential metastatic rates, etc. 

G. Use of Polynucleotides to Screen for Peptide Analogs and Antagonists 
Polypeptides encoded by the instant polynucleotides and corresponding full length 
genes can be used to screen peptide libraries to identify binding partners, such as receptors, 
from among the encoded polypeptides. 

20 A library of peptides can be synthesized following the methods disclosed in U.S. Pat. 

No. 5,010,175 ('175), and in WO 91/17823. As described below in brief, one prepares a 
mixture of peptides, which is then screened to identify the peptides exhibiting the desired 
signal transduction and receptor binding activity. In the '175 method, a suitable peptide 
synthesis support (e.g., a resin) is coupled to a mixture of appropriately protected, activated 

25 amino acids. The concentration of each amino acid in the reaction mixture is balanced or 

adjusted in inverse proportion to its coupling reaction rate so that the product is an equimolar 
mixture of amino acids coupled to the starting resin. The bound amino acids are then 
deprotected, and reacted with another balanced amino acid mixture to form an equimolar 
mixture of all possible dipeptides. This process is repeated until a mixture of peptides of the 

30 desired length (e.g., hexamers) is formed. Note that one need not include all amino acids in 
each step: one can include only one or two amino acids in some steps (e.g., where it is 
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known that a particular amino acid is essential in a given position), thus reducing the 

complexity of the mixture. After the synthesis of the peptide library is completed, the 

mixture of peptides is screened for binding to the selected polypeptide. The peptides are 

then tested for their ability to inhibit or enhance activity. Peptides exhibiting the desired 

5 activity are then isolated and sequenced. 

The method described in WO 91/17823 is similar. However, instead of reacting the 

synthesis resin with a mixture of activated amino acids, the resin is divided into twenty equal 

portions (or into a number of portions corresponding to the number of different amino acids 

to be added in that step), and each amino acid is coupled individually to its portion of resin. 

10 The resin portions are then combined, mixed, and again divided into a number of equal 

portions for reaction with the second amino acid. In this manner, each reaction can be easily 
driven to completion. Additionally, one can maintain separate "subpools" by treating 
portions in parallel, rather than combining all resins at each step. This simplifies the process 
of determining which peptides are responsible for any observed receptor binding or signal 

1 5 transduction activity. 

In such cases, the subpools containing, e.g., 1-2,000 candidates each are exposed to 
one or more polypeptides of the invention. Each subpool that produces a positive result is 
then resynthesized as a group of smaller subpools (sub-subpools) containing, e.g., 20-100 
candidates, and reassayed. Positive sub-subpools can be resynthesized as individual 

20 compounds, and assayed finally to determine the peptides that exhibit a high binding 
constant. These peptides can be tested for their ability to inhibit or enhance the native 
activity. The methods described in WO 91/7823 and U.S. Patent No. 5,194,392 (herein 
incorporated by reference) enable the preparation of such pools and subpools by automated 
techniques in parallel, such that all synthesis and resynthesis can be performed in a matter of 

25 days. 

Peptide agonists or antagonists are screened using any available method, such as 
signal transduction, antibody binding, receptor binding, mitogenic assays, chemotaxis 
assays, etc. The methods described herein are presently preferred. The assay conditions 
ideally should resemble the conditions under which the native activity is exhibited in vivo, 
30 that is, under physiologic pH, temperature, and ionic strength. Suitable agonists or 
antagonists will exhibit strong inhibition or enhancement of the native activity at 
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concentrations that do not cause toxic side effects in the subject. Agonists or antagonists that 

compete for binding to the native polypeptide can require concentrations equal to or greater 

than the native concentration, while inhibitors capable of binding irreversibly to the 

polypeptide can be added in concentrations on the order of the native concentration. 

5 The end results of such screening and experimentation will be at least one novel 

polypeptide binding partner, such as a receptor, encoded by a gene or a cDNA corresponding 

to a polynucleotide of the invention, and at least one peptide agonist or antagonist of the 

novel binding partner. Such agonists and antagonists can be used to modulate, enhance, or 

inhibit receptor function in cells to which the receptor is native, or in cells that possess the 

10 receptor as a result of genetic engineering. Further, if the novel receptor shares biologically 
important characteristics with a known receptor, information about agonist/antagonist 
binding can facilitate development of improved agonists/antagonists of the known receptor. 
H. Pharmaceutical Compositions and Therapeutic Uses 
Pharmaceutical compositions can comprise polypeptides, antibodies, or 

15 polynucleotides of the claimed invention. The pharmaceutical compositions will comprise a 
therapeutically effective amount of either polypeptides, antibodies, or polynucleotides of the 
claimed invention. 

The term "therapeutically effective amount" as used herein refers to an amount of a 
therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 

20 detectable therapeutic or preventative effect. The effect can be detected by, for example, 
chemical markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature. The precise effective amount for a subject 
will depend upon the subject's size and health, the nature and extent of the condition, and the 
therapeutics or combination of therapeutics selected for administration. Thus, it is not useful 

25 to specify an exact effective amount in advance. However, the effective amount for a given 
situation is determined by routine experimentation and is within the judgment of the 
clinician. For purposes of the present invention, an effective dose will generally be from 
about 0.01 mg/ kg to 50 mg/kg or 0,05 mg/kg to about 10 mg/kg of the DNA constructs in 
the individual to which it is administered. 

30 A pharmaceutical composition can also contain a pharmaceutically acceptable carrier. 

The term "pharmaceutically acceptable carrier" refers to a carrier for administration of a 
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therapeutic agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. 

The term refers to any pharmaceutical carrier that does not itself induce the production of 

antibodies harmful to the individual receiving the composition, and which can be 

administered without undue toxicity. Suitable carriers can be large, slowly metabolized 

5 macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 

polymeric amino acids, amino acid copolymers, and inactive virus particles. Such carriers 

are well known to those of ordinary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, mineral acid salts 

such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of 

10 organic acids such as acetates, propionates, malonates, benzoates, and the like. A thorough 
discussion of pharmaceutically acceptable excipients is available in Remington 's 
Pharmaceutical Sciences (Mack Pub. Co., NJ. 1991). 

Pharmaceutically acceptable carriers in therapeutic compositions can include liquids 
such as water, saline, glycerol and ethanol. Auxiliary substances, such as wetting or 

15 emulsifying agents, pH buffering substances, and the like, can also be present in such 

vehicles. Typically, the therapeutic compositions are prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid 
vehicles prior to injection can also be prepared. Liposomes are included within the 
definition of a pharmaceutically acceptable carrier. 

20 Delivery Methods. Once formulated, the compositions of the invention can be 

(1) administered directly to the subject (e.g., as polynucleotide or polypeptides); (2) 
delivered ex vivo, to cells derived from the subject (e.g., as in ex vivo gene therapy); or (3) 
delivered in vitro for expression of recombinant proteins (e.g., polynucleotides). Direct 
delivery of the compositions will generally be accomplished by injection, either 

25 subcutaneously, intraperitoneally, intravenously or intramuscularly, or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a tumor or 
lesion. Other modes of administration include oral and pulmonary administration, 
suppositories, and transdermal applications, needles, and gene guns or hyposprays. Dosage 
treatment can be a single dose schedule or a multiple dose schedule. 

30 Methods for the ex vivo delivery and reimplantation of transformed cells into a 

subject are known in the art and described in e.g., International Publication No. WO 
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93/14778. Examples of cells useful in ex vivo applications include, for example, stem cells, 

particularly hematopoetic, lymph cells, macrophages, dendritic cells, or tumor cells. 

Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 

accomplished by, for example, dextran-mediated transfection, calcium phosphate 

5 precipitation, polybrene mediated transfection, protoplast fusion, electroporation, 

encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 

into nuclei, all well known in the art. 

Once a gene corresponding to a polynucleotide of the invention has been found to 

correlate with a proliferative disorder, such as neoplasia, dysplasia, and hyperplasia, the 

1 0 disorder can be amenable to treatment by administration of a therapeutic agent based on the 
provided polynucleotide or corresponding polypeptide. 

Preparation of antisense polynucleotides is discussed above. Neoplasias that are 
treated with the antisense composition include, but are not limited to, cervical cancers, 
melanomas, colorectal adenocarcinomas, Wilms' tumor, retinoblastoma, sarcomas, 

1 5 myosarcomas, lung carcinomas, leukemias, such as chronic myelogenous leukemia, 

promyelocytic leukemia, monocytic leukemia, and myeloid leukemia, and lymphomas, such 
as histiocytic lymphoma. Proliferative disorders that are treated with the therapeutic 
composition include disorders such as anhydric hereditary ectodermal dysplasia, congenital 
alveolar dysplasia, epithelial dysplasia of the cervix, fibrous dysplasia of bone, and 

20 mammary dysplasia. Hyperplasias, for example, endometrial, adrenal, breast, prostate, or 
thyroid hyperplasias or pseudoepitheliomatous hyperplasia of the skin, are treated with 
antisense therapeutic compositions based upon a polynucleotide of the invention. Even in 
disorders in which mutations in the corresponding gene are not implicated, downregulation 
or inhibition of expression of a gene corresponding to a polynucleotide of the invention can 

25 have therapeutic application. For example, decreasing gene expression can help to suppress 
tumors in which enhanced expression of the gene is implicated. 

Both the dose of the antisense composition and the means of administration are 
determined based on the specific qualities of the therapeutic composition, the condition, age, 
and weight of the patient, the progression of the disease, and other relevant factors. 

30 Administration of the therapeutic antisense agents of the invention includes local or systemic 
administration, including injection, oral administration, particle gun or catheterized 
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administration, and topical administration. Preferably, the therapeutic antisense composition 

contains an expression construct comprising a promoter and a polynucleotide segment of at 

least 12, 22, 25, 30, or 35 contiguous nucleotides of the antisense strand of a polynucleotide 

disclosed herein. Within the expression construct, the polynucleotide segment is located 

5 downstream from the promoter, and transcription of the polynucleotide segment initiates at 

the promoter. 

Various methods are used to administer the therapeutic composition directly to a 
specific site in the body. For example, a small metastatic lesion is located and the 
therapeutic composition injected several times in several different locations within the body 

10 of tumor. Alternatively, arteries which serve a tumor are identified, and the therapeutic 

composition injected into such an artery, in order to deliver the composition directly into the 
tumor. A tumor that has a necrotic center is aspirated and the composition injected directly 
into the now empty center of the tumor. The antisense composition is directly administered 
to the surface of the tumor, for example, by topical application of the composition. X-ray 

15 imaging is used to. assist in certain of the above delivery methods. 

Receptor-mediated targeted delivery of therapeutic compositions containing an 
antisense polynucleotide, subgenomic polynucleotides, or antibodies to specific tissues is 
also used. Receptor-mediated DNA delivery techniques are described in, for example, 
Findeis et al, Trends Biotechnol (1993) 77:202; Chiou et aL, Gene Therapeutics: Methods 

20 And Applications Of Direct Gene Transfer (J.A. Wolff, ed.) (1994); Wu et al.,J. Biol Chem. 
(1988) 263:621; Wu et aL, J. Biol Chem. (1994) 269:542; Zenke et aL, Proc. Natl. Acad 
Set (USA) (1990) 57:3655; Wu et aL, J. Biol Chem. (1991) 266:338. Preferably, receptor- 
mediated targeted delivery of therapeutic compositions containing antibodies of the 
invention is used to deliver the antibodies to specific tissue. 

25 Therapeutic compositions containing antisense subgenomic polynucleotides are 

administered in a range of about 100 ng to about 200 mg of DNA for local administration in 
a gene therapy protocol. Concentration ranges of about 500 ng to about 50 mg, about 1 \xg to 
about 2 mg, about 5 jag to about 500 |ig, and about 20 jig to about 1 00 |ig of DNA can also 
be used during a gene therapy protocol. Factors such as method of action and efficacy of 

30 transformation and expression are considerations which will affect the dosage required for 
ultimate efficacy of the antisense subgenomic polynucleotides. Where greater expression is 
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desired over a larger area of tissue, larger amounts of antisense subgenomic polynucleotides 

or the same amounts readministered in a successive protocol of administrations, or several 

administrations to different adjacent or close tissue portions of, for example, a tumor site, 

may be required to effect a positive therapeutic outcome. In all cases, routine 

5 experimentation in clinical trials will determine specific ranges for optimal therapeutic 

effect. A more complete description of gene therapy vectors, especially retroviral vectors, is 

contained in U.S. Serial No. 08/869,309, which is expressly incorporated herein, and in 

section G below. 

For polynucleotide-related genes encoding polypeptides or proteins with anti- 
10 inflammatory activity, suitable use, doses, and administration are described in U.S. Patent 
No. 5,654,173. Therapeutic agents also include antibodies to proteins and polypeptides 
encoded by the polynucleotides of the invention and related genes, as described in U.S. 
Patent No. 5,654,173. 

L Gene Therapy 

1 5 The therapeutic polynucleotides and polypeptides of the present invention can be 

utilized in gene delivery vehicles. The gene delivery vehicle can be of viral or non-viral 
origin (see generally, Jolly, Cancer Gene Therapy (1994) 7:51; Kimura, Human Gene 
Therapy (1994) 5:845; Connelly, Human Gene Therapy (1995) 7:185; and Kaplitt, Nature 
Genetics (1994) 5:148). Gene therapy vehicles for delivery of constructs including a coding 

20 sequence of a therapeutic of the invention can be administered either locally or systemically. 
These constructs can utilize viral or non-viral vector approaches. Expression of such coding 
sequences can be induced using endogenous mammalian or heterologous promoters. 
Expression of the coding sequence can be either constitutive or regulated. 

The present invention can employ recombinant retroviruses which are constructed to 

25 carry or express a selected nucleic acid molecule of interest. Retrovirus vectors that can be 
employed include those described in EP 0 415 731; WO 90/07936; WO 94/03622; WO 
93/25698; WO 93/25234; U.S. Patent No. 5, 219,740; WO 93/1 1230; WO 93/10218; Vile 
and Hart, Cancer Res, (1993) 55:3860; Vile et al, Cancer Res. (1993) 53:962; Ram et al., 
Cancer Res. (1993) 55:83; Takamiya et al, J. Neurosci. Res. (1992) 55:493; Baba et al., J. 

30 Neurosurg. (1993) 79:729; U.S. Patent No. 4,777,127; GB Patent No. 2,200,651 ; and EP 0 
345 242. Preferred recombinant retroviruses include those described in WO 91/02805. 
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Packaging cell lines suitable for use with the above-described retroviral vector 

constructs can be readily prepared (see, e.g., WO 95/30763 and WO 92/05266), and used to 

create producer cell lines (also termed vector cell lines) for the production of recombinant 

vector particles. Within particularly preferred embodiments of the invention, packaging cell 

5 lines are made from human (such as HT1080 cells) or mink parent cell lines, thereby 

allowing production of recombinant retroviruses that can survive inactivation in human 

serum. 

The present invention also employs alphavirus-based vectors that can function as 
gene delivery vehicles. Such vectors can be constructed from a wide variety of alphaviruses, 

1 0 including, for example, Sindbis virus vectors, Semliki forest virus (ATCC VR-67; ATCC 
VR-1247), Ross River virus (ATCC VR-373; ATCC VR-1246) and Venezuelan equine 
encephalitis virus (ATCC VR-923; ATCC VR-1250; ATCC VR 1249; ATCC VR-532). 
Representative examples of such vector systems include those described in U.S. Patent Nos. 
5,091,309; 5,217,879; and 5,185,440; WO 92/10578; WO 94/21792; WO 95/27069; WO 

1 5 95/27044; and WO 95/07994. Gene delivery vehicles of the present invention can also 

employ parvovirus such as adeno-associated virus (AAV) vectors. Representative examples 
include the AAV vectors disclosed by Srivastava in WO 93/09239, Samulski et al., J. Virol 
(1989) 63:3822; Mendelson et al., Virol (1988) 166:154; and Flotte et al,PNAS (1993) 
P0:1O613. 

20 Representative examples of adenoviral vectors include those described by Berkner, 

Biotechniques (1988) 6:616; Rosenfeld et al, Science (1991) 252:431; WO 93/19191; Kolls 
et al, PNAS (1994) 97:215; Kass-Eisler et al, PNAS (1993) 90: 1 1498; Guzman et al, 
Circulation (1993) 55:2838; Guzman et al., Cir. Res. (1993) 75:1202; Zabner et al, Cell 
(1993) 75:207; Li et al, Hum. Gene Ther. (1993) 4:403; Cailaud et al, Eur. J. Neuroscl 

25 (1993) 5:1287; Vincent et al, Nat. Genet. (1993) 5:130; Jaffe et al, Nat. Genet. (1992) 

7:372; and Levrero et al, Gene (1991) 707:195. Exemplary adenoviral gene therapy vectors 
employable in this invention also include those described in WO 94/12649, WO 93/03769; 
WO 93/19191; WO 94/28938; WO 95/1 1984 and WO 95/00655. Administration of DNA 
linked to killed adenovirus as described in Curiel, Hum. Gene Ther. (1992) 3:147 can be 

30 employed. 
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Other gene delivery vehicles and methods can be employed, including polycationic 

condensed DNA linked or unlinked to killed adenovirus alone, for example Curiel. Hum. 

Gene Ther, (1992) 5:147; ligand linked DNA, for example see Wu, 1 Biol Chem (1989) 

264:16985; eukaryotic cell delivery vehicles cells, for example see U.S. Pat. No. 5,814,482; 

5 WO 95/07994; WO 96/17072; WO 95/30763; and WO 97/42338; deposition of 

photopolymerized hydrogel materials; hand-held gene transfer particle gun, as described in 

U.S. Patent No. 5,149,655; ionizing radiation as described in U.S. Patent No. 5,206,152 and 

in W092/1 1033; nucleic charge neutralization or fusion with cell membranes. Additional 

approaches are described in Philip, Mol Cell Biol (1994) 74:241 1, and in Woffendin, Proc. 

1 0 Natl Acad. Sci ( 1 994) 91 : 1 58 1 . 

Naked DNA can also be employed. Exemplary naked DNA introduction methods are 
described in WO 90/1 1092 and U.S. Patent No. 5,580,859. Uptake efficiency can be 
improved using biodegradable latex beads. DNA coated latex beads are efficiently 
transported into cells after endocytosis initiation by the beads. The method can be improved 

15 further by treatment of the beads to increase hydrophobicity and thereby facilitate disruption 
of the endosome and release of the DNA into the cytoplasm. Liposomes that can act as gene 
delivery vehicles are described in U.S. Patent No. 5,422,120; WO 95/13796; WO 94/23697; 
WO 91/14445; and EP 0524968. 

Further non-viral delivery suitable for use includes mechanical delivery systems such 

20 as the approach described in Woffendin et aL 9 Proc, Natl Acad Sci. USA (1994) 

P/(24):l 1581. Moreover, the coding sequence and the product of expression of such can be 
delivered through deposition of photopolymerized hydrogel materials. Other conventional 
methods for gene delivery that can be used for delivery of the coding sequence include, for 
example, use of hand-held gene transfer particle gun, as described in U.S. Patent No. 

25 5,149,655; use of ionizing radiation for activating transferred gene, as described in U.S. 
Patent No. 5,206,152 and WO 92/1 1033. 

The present invention will now be illustrated by reference to the following examples 
which set forth particularly advantageous embodiments. However, it should be noted that 
these embodiments are illustrative and are not to be construed as restricting the invention in 

30 any way. 
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EXAMPLES 

The present invention is now illustrated by reference to the following examples 
which set forth particularly advantageous embodiments. However, these embodiments are 
illustrative and are not meant to be construed as restricting the invention in any way. 

5 

Example 1 : Source of Biological Materials and Overview of Novel Polynucleotides 
Expressed by the Biological Materials 
Human colon cancer cell line Kml2L4-A (Morika, W. A. K. et al., Cancer Research 
(1988) 45:6863) was used to construct a cDNA library from mRNA isolated from the cells. 

10 As described in the above overview, a total of 4,693 sequences expressed by the Kml 2L4-A 
cell line were isolated and analyzed; most sequences were about 275-300 nucleotides in 
length. The KM12L4-A cell line is derived from the KM12C cell line. The KM12C cell 
line, which is poorly metastatic (low metastatic) was established in culture from a Dukes' 
stage B 2 surgical specimen (Morikawa et al Cancer Res. (1988) 45:6863). The KML4-A is 

15 a highly metastatic subline derived from KM12C (Yeatman et al Nucl Acids, Res. (1995) 
23:4007; Bao-Ling et al Proa Annu. Meet Am. Assoc. Cancer. Res. (1995) 27:3269). The 
KM12C and KM12C-derived cell lines (e.g., KM12L4, KM12L4-A, etc.) are well- 
recognized in the art as a model cell line for the study of colon cancer (see, e.g., Moriakawa 
et al , supra; Radinsky et al Clin. Cancer Res (1995) 7:19; Yeatman et al,{\ 995) supra; 

20 Yeatman et al Clin Exp. Metastasis (1996) 74:246). 

The sequences were first masked to eliminate low complexity sequences using the XBLAST 
masking program (Claverie "Effective Large-Scale Sequence Similarity Searches," In: 
Computer Methods for Macromolecular Sequence Analysis ^ Doolittle, ed., Meth. Enzymol 
266:212-227 Academic Press, NY, NY (1996); see particularly Claverie, in "Automated 

25 DNA Sequencing and Analysis Techniques" Adams et al, eds., Chap. 36, p. 267 Academic 
Press, San Diego, 1994 and Claverie et al Comput Chem. (1993) 17:191 ). Generally, 
masking does not influence the final search results, except to eliminate of relative little 
interest due to their lox complexity, and to eliminate multiple "hits" based on similarity to 
repetitive regions common to multiple sequences, e.g., Alu repeats. Masking resulted in the 

30 elimination of 43 sequences. The remaining sequences were then used in a BLASTN vs. 
Genbank search with search parameters of greater than 70% overlap, 99% identity, and a p 
value of less than 1 x 10" 40 , which search resulted in the discarding of 1,432 sequences. 
Sequences from this search also were discarded if the inclusive parameters were met, but the 
sequence was ribosomal or vector-derived. 
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The resulting sequences from the previous search were classified into three groups 

(1,2 and 3 below) and searched in a BLASTX vs. NRP (non-redundant proteins) database 

search: (1) unknown (no hits in the Genbank search), (2) weak similarity (greater than 45% 

identity and p value of less than 1 x 10" 5 ), and (3) high similarity (greater than 60% overlap, 

5 greater than 80% identity, and p value less than 1 x 10" 5 ). This search resulted in discard of 

98 sequences as having greater than 70% overlap, greater than 99% identity, and p value of 

less than 1 x 10" 40 . 

The remaining sequences were classified as unknown (no hits), weak similarity, and 
high similarity (parameters as above). Two searches were performed on these sequences. 

10 First, a BLAST vs. EST database search resulted in discard of 1771 sequences (sequences 
with greater than 99% overlap, greater than 99% similarity and a p value of less than 1x10" 
40 ; sequences with a p value of less than 1 x 1 0" 65 when compared to a database sequence of 
human origin were also excluded). Second, a BLASTN vs. Patent GeneSeq database 
resulted in discard of 15 sequences (greater than 99% identity; p value less than 1 x 10* 40 ; 

1 5 greater than 99% overlap). 

The remaining sequences were subjected to screening using other rules and 
redundancies in the dataset Sequences with a p value of less than 1 x 1 0 _1 1 1 in relation to a 
database sequence of human origin were specifically excluded. The final result provided the 
404 sequences listed in the accompanying Sequence Listing. The Sequence Listing is 

20 arranged beginning with sequences with no similarity to any sequence in a database 
searched, and ending with sequences with the greatest similarity. Each identified 
polynucleotide represents sequence from at least a partial mRNA transcript. Polynucleotides 
that were determined to be novel were assigned a sequence identification number. 

The novel polynucleotides and were assigned sequence identification numbers SEQ 

25 ID NOS: 1-404. The DNA sequences corresponding to the novel polynucleotides are 
provided in the Sequence Listing. The majority of the sequences are presented in the 
Sequence Listing in the 5' to 3' direction. A small number, 25, are listed in the Sequence 
Listing in the 5' to 3' direction but the sequence as written is actually 3' to 5'. These 
sequences are readily identified with the designation "AR" in the Sequence Name in Table 1 

30 (inserted before the claims). The sequences correctly listed in the 5' to 3' direction in the 
Sequence Listing are designated "AF." The Sequence Listing filed herewith therefore 
contains 25 sequences listed in the reverse order, namely SEQ ID NOS:47, 97, 137, 171, 
173, 179, 182, 194, 200, 202, 213, 227, 258, 264, 275, 302, 313, 324, 329, 330, 331, 338, 
358, 379, and 404. 
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Because the provided polynucleotides represent partial mRNA transcripts, two or 

more polynucleotides of the invention may represent different regions of the same mRNA 

transcript and the same gene. Thus, if two or more SEQ ID NOS: are identified as belonging 

to the same clone, then either sequence can be used to obtain the full-length mRNA or gene. 

5 In order to confirm the sequences of SEQ ID NOS : 1 -404, inserts of the clones 

corresponding to these polynucleotides were re-sequenced. These "validation" sequences 

are provided in SEQ ID NOS:405-800. These validation sequences were often longer than 

the original polynucleotide sequences. They validate, and thus often provide additional 

sequence information. Validation sequences can be correlated with the original sequences 

1 0 they validate by identifying those sequences of SEQ ID NOS: 1 -404 and the validation 

sequences of SEQ ID NOS;405-800 that share the same clone name in Table 1 . 

Example 2: Results of Public Database Search to Identify Function of Gene Products 

SEQ ID NOS: 1-404, as well as the validation sequences SEQ ID NOS:405-800, were 

1 5 translated in all three reading frames to determine the best alignment with the individual 

sequences. These amino acid sequences and nucleotide sequences are referred, generally, as 
query sequences, which are aligned with the individual sequences. Query and individual 
sequences were aligned using the BLAST programs, available over the world wide web at 
http://ww.ncbi.nlm.nih.gov/BLAST/ . Again the sequences were masked to various extents 

20 to prevent searching of repetitive sequences or poly-A sequences, using the XBLAST 
program for masking low complexity as described above in Example 1 . 

Table 2 (inserted before the claims) shows the results of the alignments. Table 2 
refers to each sequence by its SEQ ID NO:, the accession numbers and descriptions of 
nearest neighbors from the Genbank and Non-Redundant Protein searches, and the p values 

25 of the search results. Table 1 identifies each SEQ ID NO: by SEQ name, clone ID, and 

cluster. As discussed above, a single cluster includes polynucleotides representing the same 
gene or gene family, and generally represents sequences encoding the same gene product. 

For each of SEQ ID NOS: 1-800, the best alignment to a protein or DNA sequence is 
included in Table 2. The activity of the polypeptide encoded by SEQ ID NOS:1-800 is the 

30 same or similar to the nearest neighbor reported in Table 2. The accession number of the 
nearest neighbor is reported, providing a reference to the activities exhibited by the nearest 
neighbor. The search program and database used for the alignment also are indicated as well 
as a calculation of the p value. 
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Full length sequences or fragments of the polynucleotide sequences of the nearest 

neighbors can be used as probes and primers to identify and isolate the full length sequence 

of SEQ ID NOS: 1-800. The nearest neighbors can indicate a tissue or cell type to be used to 

construct a library for the full-length sequences of SEQ ID NOS: 1-800. 

5 SEQ ID NOS : 1 -800 and the translations thereof may be human homologs of known 

genes of other species or novel allelic variants of known human genes. In such cases, these 

new human sequences are suitable as diagnostics or therapeutics. As diagnostics, the human 

sequences SEQ ID NOS: 1-800 exhibit greater specificity in detecting and differentiating 

human cell lines and types than homologs of other species. The human polypeptides 

10 encoded by SEQ ID NOS: 1-800 are likely to be less immunogenic when administered to 

humans than homologs from other species. Further, on administration to humans, the 

polypeptides encoded by SEQ ID NOS: 1-800 can show greater specificity or can be better 

regulated by other human proteins than are homologs from other species. 

15 Example 3: Members of Protein Families 

After conducting a profile search as described in the specification above, several of the 
polynucleotides of the invention were found to encode polypeptides having characteristics of 
a polypeptide belonging to a known protein families (and thus represent new members of 
these protein families) and/or comprising a known functional domain (Table 3). Thus the 

20 invention encompasses fragments, fusions, and variants of such polynucleotides that retain 
biological activity associated with the protein family and/or functional domain identified 
herein. 

Table 3 Polynucleotides encoding gene products of a protein family or having a known 



functional domain(s). 



SEQID 
NO: 


Biological Activity (Profile hit) 


Start 


Stop 


Dir 


24 


4 transmembrane segments integral membrane proteins 


1218 


578 


rev 


41 


4 transmembrane segments integral membrane proteins 


1086 


413 


rev 


101 


4 transmembrane segments integral membrane proteins 


1206 


544 


rev 


157 


4 transmembrane segments integral membrane proteins 


721 


33 


rev 


341 


4 transmembrane segments integral membrane proteins 


1253 


613 


rev 


395 


4 transmembrane segments integral membrane proteins 


530 


10 


for 


395 


4 transmembrane segments integral membrane proteins 


696 


17 


for 


395 


4 transmembrane segments integral membrane proteins 


471 


39 


rev 


24 


7 transmembrane receptor (Secretin family) 


1301 


491 


rev 


41 


7 transmembrane receptor (Secretin family) 


1309 


10 


rev 


101 


7 transmembrane receptor (Secretin family) 


1330 


296 


rev 


157 


7 transmembrane receptor (Secretin family) 


1173 


249 


rev 


291 


7 transmembrane receptor (Secretin family) 


1400 


269 


rev 
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Table 3 Polynucleotides encoding gene products of a protein family or having a known 



functional domain(s). 



SEQID 
NO: 


Biological Activity (Profile hit) 


Start 


Stop 


Dir 


291 


7 transmembrane receptor (Secretin family) 


712 


130 


for 


305 


7 transmembrane receptor (Secretin family) 


926 


4 


for 


305 


~-r . i . fry j. • C *t \ 

1 transmembrane receptor (Secretin family) 


753 


55 


rev 


315 


7 transmembrane receptor (Secretin family) 


1058 


270 


rev 


341 


7 transmembrane receptor (Secretin family) 


1265 


534 


rev 


116 


Ank repeat 


141 


218 


for 


251 


Ank repeat 


290 


207 


for 


251 


Ank repeat 


467 


387 


for 


63 


ATPases Associated with Various Cellular Activities 


543 


60 


for 


116 


ATPases Associated with Various Cellular Activities 


802 


313 


for 


134 


ATPases Associated with Various Cellular Activities 


525 


57 


rev 


136 


ATPases Associated with Various Cellular Activities 


712 


163 


for 


151 


ATPases Associated with Various Cellular Activities 


719 


73 


for 


151 


ATPases Associated with Various Cellular Activities 


386 


13 


for 


384 


ATPases Associated with Various Cellular Activities 


664 


140 


for 


404 


ATPases Associated with Various Cellular Activities 




JZ 


tor 


374 [Basic region plus leucine zipper transcription factors 


ZVo 


14o 


tor 


97 


Bromodomain (conserved seauence found in human 

J^l VHIvwVillUIll ^ WliJVl » V\4 kJVVj 14 VI 1 VV IvUllVI 111 11141144411^ 

Drosophila and yeast proteins.) 


Z3U 




tor 


136 


KF-hand 


1 1 1 
1Z 1 


zu / 


fu- 
ror 


242 


EF-hand 


£tj O 


1 


fnr 


379 


EF-hand 


719 

LIZ 


1 zu 


fnr 


308 


Eukaryotic aspartyl proteases 


i inn 




rev 


213 


GATA family of transcription factors 


720 


377 


for 


367 


G-protein alpha subunit 


971 


467 


rev 


188 


Phorbol esters/diacylglycerol binding 


91 


177 


for 


251 


Phorbol esters/diacylglycerol binding 


133 


219 


for 


202 


protein kinase 


482 


1 


rev 


202 


protein kinase 


970 


1 


rev 


315 


protein kinase 


739 


1 CO 

158 


tor 


315 


protein kinase 


i mi 


1 07 
IV/ 


tor 


367 


protein kinase 






rev 


397 


protein kinase 


511 


6 


for 


256 


Protein phosphatase 2C 


1 1 
i j 


on 


ior 


256 


Protein phosphatase 2C 


1 O J 




fnr 

lur 


382 


Protein Tyrosine Phosphatase 


ZD 1 


z 


ior 


306 


SH3 Domain 


141 


296 


for 


386 


SH3 Domain 


359 


209 


for 


169 


Trypsin 


764 


164 


rev 


188 


WD domain, G-beta repeats 


480 


382 


for 


188 


WD domain, G-beta repeats 


206 


117 


for 


335 


WD domain, G-beta repeats 


3 


92 


for 


23 


wnt family of developmental signaling proteins 


1151 


335 


rev 


291 


wnt family of developmental signaling proteins 


779 


89 


rev 


291 


wnt family of developmental signaling proteins 


1347 


382 


rev 


324 


wnt family of developmental signaling proteins 


1180 


499 


rev 


330 


wnt family of developmental signaling proteins 


1180 


499 


rev 


341 


wnt family of developmental signaling proteins 


1399 


560 


rev 
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Table 3 Polynucleotides encoding gene products of a protein family or having a known 



functional domain(s). 



SEQID 
NO: 


Biological Activity (Profile hit) 


Start 


Stop 


Dir 


353 


wnt family of developmental signaling proteins 


880 


49 


rev 


188 


WW/rsp5/WWP domain containing proteins 


431 


354 


for j 


379 


WW/rsp5/WWP domain containing proteins 


12 


89 


for 


395 


WW/rsp5/WWP domain containing proteins 


153 


76 


tor 


395 


WW/rsp5/WWP domain containing proteins 


156 


64 


for j 


61 


Zinc finger, C2H2 type 


254 


192 


for | 


306 


Zinc finger, C2H2 type 


428 


367 


for 1 


386 


Zinc finger, C2H2 type 


191 


253 


for 


322 


Zinc finger, CCHC class 


553 


503 


for ! 


306 


Zinc-binding metalloprotease domain 


101 


60 


rev 


395 


Zinc-binding metalloprotease domain 


28 


69 


rev 



Start and stop indicate the position within the individual sequenes that align with the 
query sequence having the indicated SEQ ID NO. The direction (Dir) indicates the 
orientation of the query sequence with respect to the individual sequence, where forward 
5 (for) indicates that the alignment is in the same direction (left to right) as the sequence 
provided in the Sequence Listing and reverse (rev) indicates that the alignment is with a 
sequence complementary to the sequence provided in the Sequence Listing. 



Some polynucleotides exhibited multiple profile hits because, for example, the particular 
10 sequence contains overlapping profile regions, and/or the sequence contains two different 
functional domains. These profile hits are described in more detail below. 

a) Four Transmembrane Integral Membrane Proteins. SEQ ID NOS: 24, 41, 101, 
157, 341, and 395 correspond to a sequence encoding a polypeptide that is a member of the 4 
transmembrane segments integral membrane protein family (transmembrane 4 family). The 
1 5 transmembrane 4 family of proteins includes a number of evolutionarily-related eukaryotic 
cell surface antigens (Levy et al,J. Biol Chem., (1991) 266:14597; Tomlinson et al, Eur. 1 
Immunol. (1993) 25:136; Barclay et al The leucocyte antigen factbooks. (1993) Academic 
Press, London/San Diego). The proteins belonging to this family include: 1) Mammalian 
antigen CD9 (MIC3), which is involved in platelet activation and aggregation; 2) 
20 Mammalian leukocyte antigen CD37, expressed on B lymphocytes; 3) Mammalian 

leukocyte antigen CD53 (OX-44), which is implicated in growth regulation in hematopoietic 
cells; 4) Mammalian lysosomal membrane protein CD63 (melanoma-associated antigen 
ME491; antigen AD1); 5) Mammalian antigen CD81 (cell surface protein TAPA-1), which 
is implicated in regulation of lymphoma cell growth; 6) Mammalian antigen CD82 (protein 
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R2; antigen C33; Kangai 1 (KAI1)), which associates with CD4 or CD8 and delivers 

costimulatory signals for the TCR/CD3 pathway; 7) Mammalian antigen CD151 (SFA-1; 

platelet-endothelial tetraspan antigen 3 (PETA-3)); 8) Mammalian cell surface glycoprotein 

A15 (TALLA-1; MXS1); 9) Mammalian novel antigen 2 (NAG-2); 10) Human tumor- 

5 associated antigen CO-029; 1 1) Schistosoma mansoni and japonicum 23 Kd surface antigen 

(SM23 / SJ23). 

The members of the 4 transmembrane family share several characteristics. First, they 
all are apparently type III membrane proteins, which are integral membrane proteins 
containing an N-terminal membrane-anchoring domain which is not cleaved during 

10 biosynthesis and which functions both as a translocation signal and as a membrane anchor. 
The family members also contain three additional transmembrane regions, at least seven 
conserved cysteines residues, and are of approximately the same size (218 to 284 residues). 
These proteins are collectively know as the "transmembrane 4 superfamily" (TM4) because 
they span plasma membrane four times. A schematic diagram of the domain structure of 

1 5 these proteins is as follows: 

+-+ + +-— + + + + + — + 

1 1 TMa | Extra | TM2| Cyt | TM3 | Extracellular | TM4 | Cyt| 

+-+ + + — C C + CC— C — C— + C — + 

********* 

20 where Cyt is the cytoplasmic domain, TMa is the transmembrane anchor; TM2 to TM4 
represents transmembrane regions 2 to 4, 'C are conserved cysteines, and '* 'indicates the 
position of the consensus pattern. The consensus pattern spans a conserved region including 
two cysteines located in a short cytoplasmic loop between two transmembrane domains: 
Consensus pattern: G-x(3)-[LIVMF]-x(2)-[GSA]-[LIVMF](2)-G-C-x-[GA]-[STA]- x(2)- 

25 [EG]-x(2)-[CWN]-[LIVM](2). 

b) Seven Transmembrane Integral Membrane Proteins. SEQ IDNOS: 24, 41, 101, 
157, 291, 305, 315, and 341 correspond to a sequence encoding a polypeptide that is a 
member of the seven transmembrane receptor family. G-protein coupled receptors 
(Strosberg, Eur. J. Biochem. (1991) 196\\\ Kerlavage, Curr. Opin. Struct. Biol (1991) 

30 7:394; and Probst et al, DNA Cell Biol. (1992) 11 A ; and Savarese et a/., Biochem. J. (1992) 

293:1) (also called R7G) are an extensive group of hormones, neurotransmitters, odorants 

and light receptors which transduce extracellular signals by interaction with guanine 

nucleotide-binding (G) proteins. The tertiary structure of these receptors is thought to be 

highly similar. They have seven hydrophobic regions, each of which most probably spans 
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the membrane. The N-terminus is located on the extracellular side of the membrane and is 

often glycosylated, while the C-terminus is cytoplasmic and generally phosphorylated. 

Three extracellular loops alternate with three intracellular loops to link the seven 

transmembrane regions. Most, but not all of these receptors, lack a signal peptide. The most 

5 conserved parts of these proteins are the transmembrane regions and the first two 

cytoplasmic loops. A conserved acidic-Arg-aromatic triplet is present in the N-terminal 

extremity of the second cytoplasmic loop (Attwood et al, Gene (1991) 98:153) and could 

be implicated in the interaction with G proteins. 

To detect this widespread family of proteins a pattern is used that contains the 

1 0 conserved triplet and that also spans the major part of the third transmembrane helix. 

Additional information about the seven transmembrane receptor family, and methods for 
their identification and use, is found in U.S. Patent No. 5,759,804. Due in part to their 
expression on the cell surface and other attractive characteristics, seven transmembrane 
protein family members are of particular interest as drug targets, as surface antigen markers, 

1 5 and as drug delivery targets (e.g. , using antibody-drug complexes and/or use of anti-seven 
transmembrane protein antibodies as therapeutics in their own right). 

c) Ank Repeats. SEQ ID NOS: 116 and 251 represent polynucleotides encoding Ank 
repeat-containing proteins. The ankyrin motif is a 33 amino acid sequence named after the 
protein ankyrin which has 24 tandem 33-amino-acid motifs. Ank repeats were originally 

20 identified in the cell-cycle-control protein cdclO (Breeden et al, Nature (1987) 529:651). 
Proteins containing ankyrin repeats include ankyrin, myotropin, I-kappaB proteins, cell cycle 
protein cdclO, the Notch receptor (Matsuno et al, Development (1 997) 124(21):4265); G9a 
(or BAT8) of the class III region of the major histocompatibility complex (Biochem J. 
290:81 1-818, 1993), FABP, GABP, 53BP2, Linl2, glp-1, SW14, and SW16. The functions 

25 of the ankyrin repeats are compatible with a role in protein-protein interactions (Bork, 

Proteins (1993) 77<^.'363; Lambert and Bennet, Eur. J. Biochem. (1993) 277:1; Kerr et al., 
Current Op. Cell Biol. (1992) 4:496; Bennet et al, J. Biol Chem. (1980) 255:6424). 

The 90 kD N-terminal domain of ankyrin contains a series of 24 33-amino-acid ank 
repeats. (Lux et al, Nature (1990) 3443642, Lambert et al, PNAS USA (1990) 57:1730.) 

30 The 24 ank repeats form four folded subdomains of 6 repeats each. These four repeat 

subdomains mediate interactions with at least 7 different families of membrane proteins. 

Ankyrin contains two separate binding sites for anion exchanger dimers. One site utilizes 

repeat subdomain two (repeats 7-12) and the other requires both repeat subdomains 3 and 4 

(repeats 1 3-24). Since the anion exchangers exist in dimers, ankyrin binds 4 anion 
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exchangers at the same time. (Michaely and Bennett, J. Biol Chem. (1995) 270(3 7) .22050) 

The repeat motifs are involved in ankyrin interaction with tubulin, spectrin, and other 

membrane proteins. (Lux et al., Nature (1990) 344:36.) 

The Rel/NF-kappaB/Dorsal family of transcription factors have activity that is 

5 controlled by sequestration in the cytoplasm in association with inhibitory proteins referred 

to as I-kappaB. (Gilmore, Cell (1 990) 62:841 ; Nolan and Baltimore, Curr Opin Genet Dev. 

(1992) 2:21 1; Baeuerle, Biochim BiophysActa (1991) 7072:63; Schmitz et al, Trends Cell 

Biol. (1991) 7:130.) I-kappaB proteins contain 5 to 8 copies of 33 amino acid ankyrin 

repeats and certain NF-kappaB/rel proteins are also regulated by cis-acting ankyrin repeat 

10 containing domains including pl05NF-kappaB which contains a series of ankyrin repeats 
(Diehl and Hannink, 1 Virol (1993) 67(12):1\6\). The I-kappaBs and Cactus (also 
containing ankyrin repeats) inhibit activators through differential interactions with the Rel- 
homology domain. The gene family includes proto-oncogenes, thus broadly implicating I- 
kappaB in the control of both normal gene expression and the aberrant gene expression that 

1 5 makes cells cancerous. (Nolan and Baltimore, Curr Opin Genet Dev. (1 992) 2(2) :2 1 1 -220). 
In the case of rel/NF-kappaB and pp40/I-kappaBp, both the ankyrin repeats and the carboxy- 
terminal domain are required for inhibiting DNA-binding activity and direct association of 
pp40/I-kappaBp with rel/NF-kappaB protein. The ankyrin repeats and the carboxy-terminal 
of pp40/I-kappaBp ( form a structure that associates with the rel homology domain to inhibit 

20 DNA binding activity (Inoue et al, PNAS USA ( 1 992) 5P:4333). 

The 4 ankyrin repeats in the amino terminus of the transcription factor subunit 
GABPp are required for its interaction with the GABPct subunit to form a functional high 
affinity DNA-binding protein. These repeats can be crosslinked to DNA when GABP is 
bound to its target sequence. (Thompson et al., Science (1991) 255:762; LaMarco et al, 

25 Science (1991) 255:789). 

Myotrophin, a 12.5 kDa protein having a key role in the initiation of cardiac 
hypertrophy, comprises ankyrin repeats. The ankyrin repeats are characteristic of a hairpin- 
like protruding tip followed by a helix-turn-helix motif. The V-shaped helix-turn-helix of 
the repeats stack sequentially in bundles and are stabilized by compact hydrophobic cores, 

30 whereas the protruding tips are less ordered. 

d) ATPases Associated with Various Cellular Activities (AAA). SEQIDNOS:63, 
116, 134, 136, 151, 384, and 404 polynucleotides encoding novel members of the "ATPases 
Associated with diverse cellular Activities" (AAA) protein family The AAA protein family 
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is composed of a large number of ATPases that share a conserved region of about 220 amino 

acids that contains an ATP-binding site (Froehlich et ai, J. Cell Biol. (1991) 114:443; 

Erdmanne/a/. Cell (1991) 64:499; Peters etal, EMBOl (1990) 9:1757; Kunau etal., 

Biochimie (1 993) 75:209-224; Confalonieri et ai, BioEssays (1 995) 7 7:639; 

5 http://yeamob.pci.chemie.uni-tuebingen.de/AAA/Description.html). The proteins that 

belong to this family either contain one or two AAA domains. 

Proteins containing two AAA domains include: 1) Mammalian and drosophila NSF 

(N-ethylmaleimide-sensitive fusion protein) and the fungal homolog, SEC 18, which are 

involved in intracellular transport between the endoplasmic reticulum and Golgi, as well as 

1 0 between different Golgi cisternae; 2) Mammalian transitional endoplasmic reticulum 

ATPase (previously known as p97 or VCP), which is involved in the transfer of membranes 
from the endoplasmic reticulum to the golgi apparatus. This ATPase forms a ring-shaped 
homooligomer composed of six subunits. The yeast homolog, CDC48, plays a role in 
spindle pole proliferation; 3) Yeast protein PAS1 essential for peroxisome assembly and the 

15 related protein PAS1 from Pichia pastoris; 4) Yeast protein AFG2; 5) Sulfolobus 

acidocaldarius protein SAV and Halobacterium salinarium cdcH, which may be part of a 
transduction pathway connecting light to cell division. 

Proteins containing a single AAA domain include: 1) Escherichia coli and other 
bacteria ftsH (or hflB) protein. FtsH is an ATP-dependent zinc metallopeptidase that 

20 degrades the heat-shock sigma-32 factor, and is an integral membrane protein with a large 
cytoplasmic C-terminal domain that contain both the AAA and the protease domains; 2) 
Yeast protein YME1, a protein important for maintaining the integrity of the mitochondrial 
compartment. YME1 is also a zinc-dependent protease; 3) Yeast protein AFG3 (or YTA10). 
This protein also contains an AAA domain followed by a zinc-dependent protease domain; 

25 4) Subunits from regulatory complex of the 26S proteasome (Hilt et al t Trends Biochem. 
Sci. (1996) 27 :96), which is involved in the ATP-dependent degradation of ubiquitinated 
proteins, which subunits include: a) Mammalian 4 and homologs in other higher eukaryotes, 
in yeast (gene YTA5) and fission yeast (gene mts2); b) Mammalian 6 (TBP7) and homologs 
in other higher eukaryotes and in yeast (gene YTA2); c) Mammalian subunit 7 (MSS1) and 

30 homologs in other higher eukaryotes and in yeast (gene CIM5 or YTA3); d) Mammalian 
subunit 8 (P45) and homologs in other higher eukaryotes and in yeast (SUG1 or CIM3 or 
TBY1) and fission yeast (gene letl); e) Other probable subunits include human TBP1, which 
influences HIV gene expression by interacting with the virus tat transactivator protein, and 
yeast YTA1 and YTA6; 5) Yeast protein BCS1, a mitochondrial protein essential for the 
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expression of the Rieske iron-sulfur protein; 6) Yeast protein MSP1, a protein involved in 

intramitochondrial sorting of proteins; 7) Yeast protein PAS8, and the corresponding 

proteins PASS from Pichia pastoris and PA Y4 from Yarrowia lipolytica; 8) Mouse protein 

SKD1 and its fission yeast homolog (SpAC2Gl 1.06); 9) Caenorhabditis elegans meiotic 

5 spindle formation protein mei-1 ; 10) Yeast protein SAPT 1 1) Yeast protein YTA7; and 12) 

Mycobacterium leprae hypothetical protein A2126A. 

In general, the AAA domains in these proteins act as ATP-dependent protein 

clamps(Confalonieri et al (1995) BioEssays 1 7:639). In addition to the ATP-binding 'A* and 

'B' motifs, which are located in the N-terminal half of this domain, there is a highly 

10 conserved region located in the central part of the domain which was used in the 

development of the signature pattern. The consensus pattern is: [LIVMT]-x-[LIVMT]- 

[LIVMF]-x-[GATMC]-[ST]-[NS]-x(4)-[LIVM]- D-x-A-[LIFA]-x-R. 

e) Basic Region Plus Leucine Zipper Transcription Factors. SEQIDNO:374 
correspond to a polynucleotide encoding a novel member of the family of basic region plus 

15 leucine zipper transcription factors. The bZIP superfamily (Hurst, Protein Prof, (1995) 
2:105; and Ellenberger, Curr. Opin. Struct Biol. (1994) 4\\2) of eukaryotic DNA-binding 
transcription factors encompasses proteins that contain a basic region mediating sequence- 
specific DNA-binding followed by a leucine zipper required for dimerization. Members of 
the family include transcription factor AP-1, which binds selectively to enhancer elements in 

20 the cis control regions of SV40 and metallothionein IIA. AP-1, also known as c-jun, is the 
cellular homolog of the avian sarcoma virus 17 (ASV17) oncogene v-jun. 

Other members of this protein family include jun-B and jun-D, probable 
transcription factors that are highly similar to jun/AP-1 ; the fos protein, a proto-oncogene 
that forms a non-covalent dimer with c-jun; the fos-related proteins fra-1, and fos B; and 

25 mammalian cAMP response element (CRE) binding proteins CREB, CREM, ATF- 1 , ATF- 
3, ATF-4, ATF-5, ATF-6 and LRF-1. The consensus pattern for this protein family is: 
[KR]-x( 1 ,3)-[RKSAQ]-N-x(2)-[SAQ](2)-x-[RKTAENQ]-x-R-x-[RK]. 

f) Bromodomain. SEQ ID NO:97 corresponds to a polynucleotide encoding a 
polypeptide having a bromodomain region (Haynes et al, 1992, Nucleic Acids Res. 

30 20:2693-2603, Tamkun et al., 1992, Cell 68:561-572, and Tamkun, 1995, Curr. Opin. Genet. 

Dev. 5:473-477), which is a conserved region of about 70 amino acids found in the 

following proteins: 1) Higher eukaryotes transcription initiation factor TFIID 250 Kd 

subunit (TBP-associated factor p250) (gene CCG1); P250 is associated with the TFIID 

TATA-box binding protein and seems essential for progression of the Gl phase of the cell 
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cycle. 2) Human RING3, a protein of unknown function encoded in the MHC class II locus; 

3) Mammalian CREB-binding protein (CBP), which mediates cAMP-gene regulation by 

binding specifically to phosphorylated CREB protein; 4) Mammalian homologs of brahma, 

including three brahma-like human: SNF2a(hBRM), SNF2b, and BRG1; 5) Human BS69, 

5 a protein that binds to adenovirus El A and inhibits El A transactivation; 6) Human peregrin 

(or Br 140). 

The bromodomain is thought to be involved in protein-protein interactions and may 
be important for the assembly or activity of multicomponent complexes involved in 
transcriptional activation. The consensus pattern, which spans a major part of the 
1 0 bromodomain, is: [STANVF]-x(2)-F-x(4)-[DNS]-x(5,7)-[DENQTF]- Y-[HFY]-x(2)- 

[LIVMFY]-x(3)-[LIVM]-x(4)-[LIVM]-x(6 ) 8)-Y-x(12,13)-[LIVM]-x(2)-N-[SACF]-x(2)- 
[FY]. 

g) EF-Hand. SEQ ID NOS:136, 242, and 379 correspond to polynucleotides 
encoding a novel protein in the family of EF-hand proteins. Many calcium-binding proteins 

15 belong to the same evolutionary family and share a type of calcium-binding domain known 
as the EF-hand (Kawasaki et al, Protein. Prof. (1995) 2:305-490). This type of domain 
consists of a twelve residue loop flanked on both sides by a twelve residue alpha-helical 
domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal 
configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; 

20 these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 
12 provides two oxygens for liganding Ca (bidentate ligand). 

Proteins known to contain EF-hand regions include: Calmodulin (Ca=4, except in 
yeast where Ca=3) ("Ca=" indicates approximate number of EF-hand regions); 
diacylglycerol kinase (EC 2.7.1.107) (DGK) (Ca=2); 2) FAD-dependent glycerol-3- 

25 phosphate dehydrogenase (EC 1 . 1 .99.5) from mammals (Ca=l); guanylate cyclase activating 
protein (GCAP) (Ca=3); MIF related proteins 8 (MRP-8 or CFAG) and 14 (MRP-14) 
(Ca=2); myosin regulatory light chains (Ca=l); oncomodulin (Ca=2); osteonectin (basement 
membrane protein BM-40) (SPARC); and proteins that contain an "osteonectin" domain 
(QR1, matrix glycoprotein SCI). 

30 The consensus pattern includes the complete EF-hand loop as well as the first residue 

which follows the loop and which seem to always be hydrophobic. 

Consensus pattern: D-x-[DNS]-{ILVFYW}-[DENSTG]-[DNQGHRK]^{GP}- 
[LIVMC]-[DENQSTAGC]-x(2)-[DE]-[LIVMFYW] 
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h) Eukarvotic Aspartyl Proteases. SEQ ID NO:308 corresponds to a gene encoding a 

novel eukaryotic aspartyl protease. Aspartyl proteases, known as acid proteases, (EC 
3.4.23.-) are a widely distributed family of proteolytic enzymes (Foltmann B., Essays 
Biochem. (1981) 77:52; Davies D.R., Annu. Rev. Biophys. Chem. (1990) 7P:189; Rao 
5 J.K.M., et al., Biochemistry (1991) 30:4663) known to exist in vertebrates, fungi, plants, 
retroviruses and some plant viruses. Aspartate proteases of eukaryotes are monomeric 
enzymes which consist of two domains. Each domain contains an active site centered on a 
catalytic aspartyl residue. The two domains most probably evolved from the duplication of 
an ancestral gene encoding a primordial domain. Currently known eukaryotic aspartyl 

10 proteases include: 1) Vertebrate gastric pepsins A and C (also known as gastricsin); 2) 
Vertebrate chymosin (rennin), involved in digestion and used for making cheese; 3) 
Vertebrate lysosomal cathepsins D (EC 3.4.23.5) and E (EC 3.4.23.34); 4) Mammalian renin 
(EC 3.4.23.15) whose function is to generate angiotensin I from angiotensinogen in the 
plasma; 5) Fungal proteases such as aspergillopepsin A (EC 3.4.23.18), candidapepsin (EC 

15 3.4.23.24), mucoropepsin (EC 3.4.23.23) (mucor rennin), endothiapepsin (EC 3.4.23.22), 
polyporopepsin (EC 3.4.23.29), and rhizopuspepsin (EC 3.4.23.21); and 6) Yeast 
saccharopepsin (EC 3.4.23.25) (proteinase A) (gene PEP4). PEP4 is implicated in 
posttranslational regulation of vacuolar hydrolases; 7) Yeast barrierpepsin (EC 3.4.23.35) 
(gene BAR1); a protease that cleaves alpha-factor and thus acts as an antagonist of the 

20 mating pheromone; and 8) Fission yeast sxal which is involved in degrading or processing 
the mating pheromones. 

Most retroviruses and some plant viruses, such as badnaviruses, encode for an 
aspartyl protease which is an homodimer of a chain of about 95 to 125 amino acids. In most 
retroviruses, the protease is encoded as a segment of a polyprotein which is cleaved during 

25 the maturation process of the virus. It is generally part of the pol polyprotein and, more 

rarely, of the gag polyprotein. Because the sequence around the two aspartates of eukaryotic 
aspartyl proteases and around the single active site of the viral proteases is conserved, a 
single signature pattern can be used to identify members of both groups of proteases. The 
consensus pattern is: [LIVMFGAC]-[LIVMTADN]-[LIVFSA]-D-[ST]-G-[STAV]- 

30 [STAPDENQ]- x-[LIVMFSTNC]-x-[LIVMFGTA], where D is the active site residue. 

i) GATA Family of Transcription Factors. SEQ ID NO:213 corresponds to a novel 

member of the GATA family of transcription factors. The GATA family of transcription 

factors are proteins that bind to DNA sites with the consensus sequence (A/T)GATA(A/G), 

found within the regulatory region of a number of genes. Proteins currently known to belong 
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to this family are: 1) GATA-1 (Trainor, CD., et al, Nature (1990) 343:92) (also known as 

Eryfl, GF-1 or NF-E1), which binds to the GATA region of globin genes and other genes 

expressed in erythroid cells. It is a transcriptional activator which probably serves as a 

general 'switch' factor for erythroid development; 2) GATA-2 (Lee, M.E., et al,J. Biol. 

5 Chem. (1991) 266: 1 6 1 88), a transcriptional activator which regulates endothelin- 1 gene 

expression in endothelial cells; 3) GATA-3 (Ho, I.-C, etal, EMBOJ. (1991) 70:1 187), a 

transcriptional activator which binds to the enhancer of the T-cell receptor alpha and delta 

genes; 4) GATA-4 (Spieth, J., et al, Mol Cell Biol (1991) 77:4651), a transcriptional 

activator expressed in endodermally derived tissues and heart; 5) Drosophila protein pannier 

1 0 (or DGATAa) (gene pnr) which acts as a repressor of the achaete-scute complex (as-c); 6) 
Bombyx mori BCFI (Drevet, J.R., et al y J, Biol Chem. (1994) 269: 10660), which regulates 
the expression of chorion genes; 7) Caenorhabditis elegans elt-1 and elt-2, transcriptional 
activators of genes containing the GATA region, including vitellogenin genes (Hawkins, 
M.G., et aL 9 J. Biol Chem. (1995) 270:14666); 8) Ustilago maydis urbsl (Voisard, C.P.O., 

1 5 et al, Mol Cell Biol (1993) 73:7091), a protein involved in the repression of the 
biosynthesis of siderophores; 9) Fission yeast protein GAF2. 

All these transcription factors contain a pair of highly similar 'zinc finger' type 
domains with the consensus sequence C-x2-C-xl7-C-x2-C. Some other proteins contain a 
single zinc finger motif highly related to those of the GATA transcription factors. These 

20 proteins are: 1) Drosophila box A-binding factor (ABF) (also known as protein serpent 

(gene srp)) which may function as a transcriptional activator protein and may play a key role 
in the organogenesis of the fat body; 2) Emericella nidulans are (Arst, H.N., Jr., et al, 
Trends Genet. (1989) 5:291) a transcriptional activator which mediates nitrogen metabolite 
repression; 3) Neurospora crassa nit-2 (Fu, Y.-H., et al, Mol Cell Biol (1990) 70:1056), a 

25 transcriptional activator which turns on the expression of genes coding for enzymes required 
for the use of a variety of secondary nitrogen sources, during conditions of nitrogen 
limitation; 4) Neurospora crassa white collar proteins 1 and 2 (WC-1 and WC-2), which 
control expression of light-regulated genes; 5) Saccharomyces cerevisiae DAL81 (or 
UGA43), a negative nitrogen regulatory protein; 6) Saccharomyces cerevisiae GLN3, a 

30 positive nitrogen regulatory protein; 7) Saccharomyces cerevisiae GAT1; 8) Saccharomyces 
cerevisiae GZF3. 

The consensus pattern for the GATA family is: C-x-[DN]-C-x(4,5)-[ST]-x(2)-W- 
[HR]-[RK]-x(3)-[GN]-x(3,4)-C-N-[AS]-C, where the four C's are zinc ligands. 
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j) G-Protein Alpha Subunit. SEQ ID NO:367 corresponds to a gene encoding a novel 

polypeptide of the G-protein alpha subunit family. Guanine nucleotide binding proteins (G- 

proteins) are a family of membrane-associated proteins that couple extracellularly-activated 

integral-membrane receptors to intracellular effectors, such as ion channels and enzymes that 

5 vary the concentration of second messenger molecules. G-proteins are composed of 3 

subunits (alpha, beta and gamma) which, in the resting state, associate as a trimer at the inner 

face of the plasma membrane. The alpha subunit has a molecule of guanosine diphosphate 

(GDP) bound to it. Stimulation of the G-protein by an activated receptor leads to its 

exchange for GTP (guanosine triphosphate). This results in the separation of the alpha from 

1 0 the beta and gamma subunits, which always remain tightly associated as a dimer. Both the 
alpha and beta-gamma subunits are then able to interact with effectors, either individually or 
in a cooperative manner. The intrinsic GTPase activity of the alpha subunit hydrolyses the 
bound GTP to GDP. This returns the alpha subunit to its inactive conformation and allows it 
to reassociate with the beta-gamma subunit, thus restoring the system to its resting state. 

1 5 G-protein alpha subunits are 350-400 amino acids in length and have molecular 

weights in the range 40-45 kDa. Seventeen distinct types of alpha subunit have been 
identified in mammals. These fall into 4 main groups on the basis of both sequence 
similarity and function: alpha-s, alpha-q, alpha-i and alpha- 12 (Simon et al., Science (1993) 
252:802). Many alpha subunits are substrates for ADP-ribosylation by cholera or pertussis 

20 toxins. They are often N-terminally acylated, usually with myristate and/or palmitoylate, 
and these fatty acid modifications are probably important for membrane association and 
high- affinity interactions with other proteins. The atomic structure of the alpha subunit of 
the G-protein involved in mammalian vision, transducin, has been elucidated in both GTP- 
and GDB-bound forms, and shows considerable similarity in both primary and tertiary 

25 structure in the nucleotide-binding regions to other guanine nucleotide binding proteins, such 
as p21-ras and EF-Tu. 

k) Phorbol Esters/Diacvlglvcerol Binding. SEQ ID NO: 188 and 251 represent 
polynucleotides encoding a protein belonging to the family including phorbol 
esters/diacylglycerol binding proteins. Diacylglycerol (DAG) is an important second 

30 messenger. Phorbol esters (PE) are analogues of DAG and potent tumor promoters that 

cause a variety of physiological changes when administered to both cells and tissues. DAG 

activates a family of serine/threonine protein kinases, collectively known as protein kinase C 

(PKC) (Azzi et al, Eur. J. Biochem, (1992) 208:547). Phorbol esters can directly stimulate 

PKC. The N-terminal region of PKC, known as CI, has been shown (Ono et al, Proc. Natl 

96 



WO 99/33982 PCT/US98/2761 0 

Acad Sci. USA (1989) S<5:4868) to bind PE and DAG in a phospholipid and zinc-dependent 

fashion. The CI region contains one or two copies (depending on the isozyme of PKC) of a 

cysteine-rich domain about 50 amino-acid residues long and essential for DAG/PE-binding. 

Such a domain has also been found in, for example, the following proteins. 

5 ( 1 ) Diacylglycerol kinase (EC 2.7. 1 . 1 07) (DGK) (Sakane et al , Nature ( 1 990) 

344:345), the enzyme that converts DAG into phosphatidate. It contains two copies of the 

DAG/PE-binding domain in its N-terminal section. At least five different forms of DGK are 

known in mammals; and 

(2) N-chimaerin, a brain specific protein which shows sequence similarities with the 

10 BCR protein at its C-terminal part and contains a single copy of the DAG/PE-binding 

domain at its N-terminal part. It has been shown (Ahmed et al, Biochem. J. (1990) 272:767, 
and Ahmed et al , Biochem. J. (1991) 280:233) to be able to bind phorbol esters. 

The DAG/PE-binding domain binds two zinc ions; the ligands of these metal ions are 
probably the six cysteines and two histidines that are conserved in this domain. The 

15 signature pattern completely spans the DAG/PE domain. The consensus pattern is: H-x- 
[LIVMFYW]-x(8,ll)-C-x(2)-C-x(3)-[LIVMFC]<5,10)-C-x(2)-C-x(4)-[HD]-x(2)-C-x(5,9)- 
C. All the C and H are probably involved in binding zinc. 

1) Protein Kinase. SEQ ID NOS:202, 315, 367, and 397 represent polynucleotides 
encoding protein kinases. Protein kinases catalyze phosphorylation of proteins in a variety of 

20 pathways, and are implicated in cancer. Eukaryotic protein kinases (Hanks S.K., et al, 
FASEB1 (1995) 9:576; Hunter T., Meth Enzymol (1991) 200:3; Hanks S.K., et al,Meth. 
Enzymol (1991) 200:38; Hanks S.K., Curr Opin. Struct. Biol (1991) 7:369; Hanks S.K., et 
al. Science (1988) 241:42) are enzymes that belong to a very extensive family of proteins 
which share a conserved catalytic core common to both serine/threonine and tyrosine protein 

25 kinases. There are a number of conserved regions in the catalytic domain of protein kinases. 
Two of the conserved regions are the basis for the signature pattern in the protein kinase 
profile. The first region, which is located in the N-terminal extremity of the catalytic 
domain, is a glycine-rich stretch of residues in the vicinity of a lysine residue, which has 
been shown to be involved in ATP binding. The second region, which is located in the 

30 central part of the catalytic domain, contains a conserved aspartic acid residue which is 
important for the catalytic activity of the enzyme (Knighton D.R., et al, Science (1991) 
255:407). The protein kinase profile includes two signature patterns for this second region: 
one specific for serine/threonine kinases and the other for tyrosine kinases. A third profile is 
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based on the alignment in (Hanks S.K., et al, FASEB J. (1995) 9:576) and covers the entire 

catalytic domain. The consensus patterns are as follows: 

1) Consensus pattern: [LIV]-G-{P}-G-{P}-[FYWMGSTNH]-[SGA]-{PW}- 
[LIVCAT]-{PD}-x-[GSTACLIVMFY]-x(5,18)-[LIVMFYWCSTAR]-[AIVP]- 

5 [LIVMFAGCKRJ-K, where K binds ATP. The majority of known protein kinases are 
detected by this pattern. Proteins kinases that are not detected by this consensus include 
viral kinases, which are quite divergent in this region and are completely missed by this 
pattern. 

2) Consensus pattern: [LIVMFYC]-x-[HY]-x-D-[LIVMFY]-K-x(2)-N- 

10 [LIVMFYCT](3) 5 where D is an active site residue. This consensus sequence identifies most 
serine/threonine-specific protein kinases with only 10 exceptions. Half of the exceptions are 
viral kinases, while the other exceptions include Epstein-Barr virus BGLF4 and Drosophila 
ninaC, which have Ser and Arg, respectively, instead of the conserved Lys. These latter two 
protein kinases are detected by the tyrosine kinase specific pattern described below. 

15 3) Consensus pattern: [LIVMFYC]-x-[HY]-x-D-[LIVMFY]-[RSTAC]-x(2)-N- 

[LIVMFYC], where D is an active site residue. All tyrosine-specific protein kinases are 
detected by this consensus pattern, with the exception of human ERBB3 and mouse blk. 
This pattern also detects most bacterial aminoglycoside phosphotransferases (Benner S., 
Nature (1987) 329:21; Kirby R., J. Mol EvoL (1992) 30:489) and herpesviruses ganciclovir 

20 kinases (Littler E., et aL, Nature (1992) 358:160), which are structurally and evolutionary 
related to protein kinases. 

The protein kinase profile also detects receptor guanylate cyclases and 2-5A- 
dependent ribonucleases. Sequence similarities between these two families and the 
eukaryotic protein kinase family have been noticed previously. The profile also detects 

25 Arabidopsis thaliana kinase-like protein TMKL1 which seems to have lost its catalytic 
activity. 

If a protein analyzed includes the two of the above protein kinase signatures, the 
probability of it being a protein kinase is close to 100%. Eukaryotic-type protein kinases 
have also been found in prokaryotes such as Myxococcus xanthus (Munoz-Dorado J., et aL, 
30 Cell (1991) 67:995) and Yersinia pseudotuberculosis. The patterns shown above has been 
updated since their publication in (Bairoch A., et aL, Nature (1988) 331:22). 

m) Protein Phosphatase 2C. SEQ ID NO:256 corresponds to a polynucleotide 

encoding a novel protein phosphatase 2C (PP2C), which is one of the four major classes of 

mammalian serine/threonine specific protein phosphatases. PP2C (Weak et al. , FEBS Lett. 

98 



WO 99/33982 PCT/US98/27610 
(1992) 297:135) is a monomeric enzyme of about 42 Kd which shows broad substrate 

specificity and is dependent on divalent cations (mainly manganese and magnesium) for its 

activity. Three isozymes are currently known in mammals: PP2C-alpha, -beta and -gamma. 

n) Protein Tyrosine Phosphatase. SEQ ID NO:382 represents a polynucleotide 

5 encoding a protein tyrosine kinase. Tyrosine specific protein phosphatases (EC 3 . 1 .3.48) 

(PTPase) (Fischer et al, Science (1991) 255:401; Charbonneau et al. 9 Annu. Rev. Cell Biol. 

(1992) 5:463; Trowbridge, J. Biol Chem. (1991) 266:23517; Tonks etai, Trends Biochem. 

Sci. (1989) 74:497; and Hunter, Cell (1989) 55:1013) catalyze the removal of a phosphate 

group attached to a tyrosine residue. These enzymes are very important in the control of cell 

10 growth, proliferation, differentiation and transformation. Multiple forms of PTPase have 
been characterized and can be classified into two categories: soluble PTPases and 
transmembrane receptor proteins that contain PTPase domain(s). 

Soluble PTPases include PTPN3 (HI) and PTPN4 (MEG), enzymes that contain an 
N-terminal band 4.1 -like domain and could act at junctions between the membrane and 

15 cytoskeleton; PTPN6 (PTP-1C; HCP; SHP) and PTPN1 1 (PTP-2C; SH-PTP3; Syp), 
enzymes that contain two copies of the SH2 domain at its N-terminal extremity. 

Dual specificity PTPases include DUSP1 (PTPN10; MAP kinase phosphatase- 1; 
MKP-1) which dephosphorylates MAP kinase on both Thr-183 and Tyr-185; and DUSP2 
(PAC-1), a nuclear enzyme that dephosphorylates MAP kinases ERK1 and ERK2 on both 

20 Thr and Tyr residues. 

Structurally, all known receptor PTPases are made up of a variable length 
extracellular domain, followed by a transmembrane region and a C-terminal catalytic 
cytoplasmic domain. Some of the receptor PTPases contain fibronectin type III (FN-III) 
repeats, immunoglobulin-like domains, MAM domains or carbonic anhydrase-like domains 

25 in their extracellular region. The cytoplasmic region generally contains two copies of the 
PTPAse domain. The first seems to have enzymatic activity, while the second is inactive but 
seems to affect substrate specificity of the first. In these domains, the catalytic cysteine is 
generally conserved but some other, presumably important, residues are not. 

PTPase domains consist of about 300 amino acids. There are two conserved 

30 cysteines and the second one has been shown to be absolutely required for activity. 

Furthermore, a number of conserved residues in its immediate vicinity have also been shown 
to be important. The consensus pattern for PTPases is: [LIVMF]-H-C-x(2)-G-x(3)-[STC]- 
[STAGP]-x-[LIVMFY]; C is the active site residue. 
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o) SH3 Domain. SEQ ID NO:306 and 386 represent polynucleotides encoding SH3 

domain proteins. The Src homology 3 (SH3) domain is a small protein domain of about 60 
amino acid residues first identified as a conserved sequence in the non-catalytic part of 
several cytoplasmic protein tyrosine kinases (e.g. Src, Abl, Lck) (Mayer et aL, Nature (1988) 
5 532:272). The domain has also been found in a variety of intracellular or membrane- 
associated proteins (Musacchio et aL, FEBS Lett. (1992) 307:55; Pawson et aL, Curr. Biol. 
(1993) 3:434; Mayer et aL, Trends Cell Biol. (1993) 3:8; and Pawson et aL, Nature (1995) 
373:573). 

The SH3 domain has a characteristic fold that consists of five or six beta-strands 

10 arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain 
short helices (Kuriyan et aL, Curr. Opin. Struct. BioL (1993) 3:828). It is believed that SH3 
domain-containing proteins mediate assembly of specific protein complexes via binding to 
proline-rich peptides (Morton et aL, Curr. BioL (1994) 4:615). In general, SH3 domains are 
found as single copies in a given protein, but there is a significant number of proteins with 

1 5 two SH3 domains and a few with 3 or 4 copies. 

SH3 domains have been identified in, for example, protein tyrosine kinases, such as 
the Src, Abl, Bkt, Csk and ZAP70 families of kinases; mammalian phosphatidylinositol- 
specific phospholipase C-gamma-1 and -2; mammalian phosphatidyl inositol 3-kinase 
regulatory p85 subunit; mammalian Ras GTPase-activating protein (GAP); mammalian Vav 

20 oncoprotein, a guanine nucleotide exchange factor of the CDC24 family; Drosophila 
lethal(l)discs large- 1 tumor suppressor protein (gene Dlgl); mammalian tight junction 
protein ZO-1; vertebrate erythrocyte membrane protein p55; Caenorhabditis elegans protein 
lin-2; rat protein CASK; and mammalian synaptic proteins SAP90/PSD-95, CHAPS YN- 
1 10/PSD-93, SAP97/DLG1 and SAP102. Novel SH3-domain containing polypeptides will 

25 facilitate elucidation of the role of such proteins in important biological pathways, such as 
ras activation. 

P) Trypsin. SEQ ID NO: 1 69 corresponds to a novel serine protease of the trypsin 
family. The catalytic activity of the serine proteases from the trypsin family is provided by a 
charge relay system involving an aspartic acid residue hydrogen-bonded to a histidine, which 
30 itself is hydrogen-bonded to a serine. The sequences in the vicinity of the active site serine 
and histidine residues are well conserved in this family of proteases (Brenner S., Nature 
(1988) 334:528). Proteases known to belong to the trypsin family include: 1) Acrosin; 2) 
Blood coagulation factors VII, IX, X, XI and XII, thrombin, plasminogen, and protein C; 3) 
Cathepsin G; 4) Chymotrypsins; 5) Complement components Clr, Cls, C2, and complement 
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factors B, D and I; 6) Complement-activating component of RA-reactive factor; 7) Cytotoxic 

cell proteases (granzymes A to H); 8) Duodenase I; 9) Elastases 1, 2, 3 A, 3B (protease E), 

leukocyte (medullasin).; 10) Enterokinase (EC 3.4.21.9) (enteropeptidase); 1 1) Hepatocyte 

growth factor activator; 12) Hepsin; 13) Glandular (tissue) kallikreins (including EGF- 

5 binding protein types A, B, and C, NGF-gamma chain, gamma-renin, prostate specific 

antigen (PSA) and tonin); 14) Plasma kallikrein; 15) Mast cell proteases (MCP) 1 (chymase) 

to 8; 16) Myeloblastin (proteinase 3) (Wegener's autoantigen); 17) Plasminogen activators 

(urokinase-type, and tissue-type); 18) Trypsins I, II, III, and IV; 19) Tryptases; 20) Snake 

venom proteases such as ancrod, batroxobin, cerastobin, flavoxobin, and protein C activator; 

10 21) Collagenase from common cattle grub and collagenolytic protease from Atlantic sand 
fiddler crab; 22) Apolipoprotein(a); 23) Blood fluke cercarial protease; 24) Drosophila 
trypsin like proteases: alpha, easter, snake-locus; 25) Drosophila protease stubble (gene sb); 
and 26) Major mite fecal allergen Der p III. All the above proteins belong to family S 1 in 
the classification of peptidases (Rawlings N.D., et al, Meth. Enzymol (1994) 244\\% 

15 htt p://www.expasv.ch/cgi-bin/lists7peptidas.tx0 and originate from eukaryotic species. It 
should be noted that bacterial proteases that belong to family S2A are similar enough in the 
regions of the active site residues that they can be picked up by the same patterns. 

The consensus patterns for this trypsin protein family are: 1) [LIVM]-[ST]-A- 
[STAGJ-H-C, where H is the active site residue. All sequences known to belong to this class 

20 detected by the pattern, except for complement components Clr and CI s, pig plasminogen, 
bovine protein C, rodent urokinase, ancrod, gyroxin and two insect trypsins; 2) 
[DNSTAGC]-[GSTAPIMVQH]-x(2)-G-[DE]-S-G-[GS]-[SAPHV]- [LIVMFYWH]- 
[LIVMFYSTANQH], where S is the active site residue. All sequences known to belong to 
this family are detected by the above consensus sequences, except for 1 8 different proteases 

25 which have lost the first conserved glycine. If a protein includes both the serine and the 
histidine active site signatures, the probability of it being a trypsin family serine protease is 
100%. 

q) WD Domain. G-Beta Repeats. SEQ ID NOS: 1 88 and 335 represent novel 
members of the WD domain/G-beta repeat family. Beta-transducin (G-beta) is one of the 
30 three subunits (alpha, beta, and gamma) of the guanine nucleotide-binding proteins (G 
proteins) which act as intermediaries in the transduction of signals generated by 
transmembrane receptors (Oilman, Annu. Rev. Biochem. (1987) 56:615). The alpha subunit 
binds to and hydrolyzes GTP; the functions of the beta and gamma subunits are less clear but 
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they seem to be required for the replacement of GDP by GTP as well as for membrane 

anchoring and receptor recognition. 

In higher eukaryotes, G-beta exists as a small multigene family of highly conserved 

proteins of about 340 amino acid residues. Structurally, G-beta consists of eight tandem 

5 repeats of about 40 residues, each containing a central Trp-Asp motif (this type of repeat is 

sometimes called a WD-40 repeat). Such a repetitive segment has been shown to exist in a 

number of other proteins including: human LIS1 , a neuronal protein involved in type-1 

lissencephaly; and mammalian coatomer beta 1 subunit (beta'-COP), a component of a 

cytosolic protein complex that reversibly associates with Golgi membranes to form vesicles 

1 0 that mediate biosynthetic protein transport. 

The consensus pattern for the WD domain/G-Beta repeat family is: [LIVMSTAC]- 

[LIVMFYWSTAGC]-[LIMSTAG]-[LIVMSTAGC]-x(2)-[DN]-x(2)-[LIVMWSTAC]-x- 

[LIVMFSTAG]-W-[DEN]-[LIVMFSTAGCN]. 

r) wnt Family of Developmental Signaling Proteins. SEQ ID NO: 23, 291, 324, 330, 

15 341, and 353 correspond to novel members of the wnt family of developmental signaling 

proteins. Wnt-1 (previously known as int-1), the seminal member of this family, (Nusse R., 

Trends Genet. (1988) 4:291) is a proto-oncogene induced by the integration of the mouse 

mammary tumor virus. It is thought to play a role in intercellular communication and seems 

to be a signalling molecule important in the development of the central nervous system 

20 (CNS). The sequence of wnt-1 is highly conserved in mammals, fish, and amphibians. Wnt- 

1 was found to be a member of a large family of related proteins (Nusse R., et al, Cell 
(1992) 69:1073; McMahon A.P., Trends Genet (1992) 5:1; Moon R.T., BioEssays (1993) 
75:91) that are all thought to be developmental regulators. These proteins are known as wnt- 

2 (also known as irp), wnt-3, -3 A, -4, -5A, -5B, -6, -7A, -7B, -8, -8B, -9 and -10. At least 
25 four members of this family are present in Drosophila; one of them, wingless (wg), is 

implicated in segmentation polarity. All these proteins share the following features 

characteristics of secretory proteins: a signal peptide, several potential N-glycosylation sites 

and 22 conserved cysteines that are probably involved in disulfide bonds. The Wnt proteins 

seem to adhere to the plasma membrane of the secreting cells and are therefore likely to 

30 signal over only few cell diameters. The consensus pattern, which is based upon a highly 

conserved region including three cysteines, is as follows: C-K-C-H-G-[LIVMT]-S-G-x-C. 

All sequences known to belong to this family are detected by the provided consensus pattern. 

s) Ww/rso5/WWP Domain-Containing Proteins. SEQ ID NOS: 188, 379 , and 395 

represent polynucleotides encoding a polypeptide in the family of WW/rsp5/ WWP domain- 
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containing proteins. The WW domain (Bork et aL, Trends Biochem. Sci. (1994) 7P:531; 

Andre et ai, Biochem. Biophys. Res. Commun. (1994) 205:1201; Hofmann et aL, FEBS Lett. 

(1995) 555:153; and Sudol et al. 9 FEBS Lett. (1995) 369:67), also known as rsp5 or WWP), 

was originally discovered as a short conserved region in a number of unrelated proteins, 

5 among them dystrophin, the gene responsible for Duchenne muscular dystrophy. The 

domain, which spans about 35 residues, is repeated up to 4 times in some proteins. It has 

been shown (Chen et al. 9 Proc. Natl Acad. Sci. USA (1995) 92:7819) to bind proteins with 

particular proline-motifs, [AP]-P-P-[AP]-Y, and thus resembles somewhat SH3 domains. It 

appears to contain beta-strands grouped around four conserved aromatic positions, generally 

10 Trp. The name WW or WWP derives from the presence of these Trp as well as that of a 

conserved Pro. It is frequently associated with other domains typical for proteins in signal 

transduction processes. 

Proteins containing the WW domain include: 

1 . Dystrophin, a multidomain cytoskeletal protein. Its longest alternatively 
15 spliced form consists of an N-terminal actin-binding domain, followed by 24 spectrin-like 

repeats, a cysteine-rich calcium-binding domain and a C-terminal globular domain. 
Dystrophins form tetramers and is thought to have multiple functions including involvement 
in membrane stability, transduction of contractile forces to the extracellular environment and 
organization of membrane specialization. Mutations in the dystrophin gene lead to muscular 
20 dystrophy of Duchenne or Becker type. Dystrophin contains one WW domain C-terminal of 
the spectrin-repeats. 

2. Vertebrate YAP protein, which is a substrate of an unknown serine kinase. It 
binds to the SH3 domain of the Yes oncoprotein via a proline-rich region. This protein 
appears in alternatively spliced isoforms, containing either one or two WW domains. 

25 3. IQGAP, which is a human GTPase activating protein acting on ras. It 

contains an N-terminal domain similar to fly muscle mp20 protein and a C-terminal ras 
GTPase activator domain. 

For the sensitive detection of WW domains, the profile spans the whole homology 
region as well as a pattern. The consensus for this family is: W-x(9,l 1)-[VFY]-[FYW]- 

30 x(6,7)-[GSTNE]-[GSTQCR]-[FYW]-x(2)-P. 

t) Zinc Finger, C2H2 Type. SEQ ID NO:61, 306, and 386 correspond to 
polynucleotides encoding novel members of the of the C2H2 type zinc finger protein family. 
Zinc finger domains (Klug et ai, Trends Biochem. Sci (1987) 72:464; Evans et ai, Cell 
(1988) 52:1; Payre et al., FEBSLett. (1988) 254:245; Miller et ai, EMBOJ. (1985) 4:1609; 
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and Berg, Proa Natl Acad Set USA (1988) 55:99) are nucleic acid-binding protein 

structures first identified in the Xenopus transcription factor TFIIIA. These domains have 

since been found in numerous nucleic acid-binding proteins. A zinc finger domain is 

composed of 25 to 30 amino acid residues. Two cysteine or histidine residues are positioned 

5 at both extremities of the domain, which are involved in the tetrahedral coordination of a 

zinc atom. It has been proposed that such a domain interacts with about five nucleotides. 

Many classes of zinc fingers are characterized according to the number and positions 

of the histidine and cysteine residues involved in the zinc atom coordination. In the first 

class to be characterized, called C2H2, the first pair of zinc coordinating residues are 

10 cysteines, while the second pair are histidines. A number of experimental reports have 

demonstrated the zinc-dependent DNA or RNA binding property of some members of this 

class. 

Mammalian proteins having a C2H2 zipper include (number in parenthesis indicates 
number of zinc finger regions in the protein): basonuclin (6), BCL-6/LAZ-3 (6), erythroid 

15 krueppel-like transcription factor (3), transcription factors Spl (3), Sp2 (3), Sp3 (3) and 
Sp(4) 3, transcriptional repressor YY1 (4), Wilms' tumor protein (4), EGRl/Krox24 (3), 
EGR2/Krox20 (3), EGR3/Pilot (3), EGR4/AT133 (4), Evi-1 (10), GUI (5), GLI2 (4+), 
GLI3 (3+), HIV-EP1/ZNF40 (4), HIV-EP2 (2), KR1 (9+), KR2 (9), KR3 (15+), KR4 (14+), 
KR5 (1 1+), HF.12 (6+), REX-1 (4), ZfX (13), ZfY (13), Zfp-35 (18), ZNF7 (15), ZNF8 (7), 

20 ZNF35 (10), ZNF42/MZF-1 (13), ZNF43 (22), ZNF46/Kup (2), ZNF76 (7), ZNF91 (36), 
ZNF133(3). 

In addition to the conserved zinc ligand residues, it has been shown that a number of 
other positions are also important for the structural integrity of the C2H2 zinc fingers. 
(Rosenfeld et al.,J. Biomol Struct. Dyn. (1993) 77:557) The best conserved position is 
25 found four residues after the second cysteine; it is generally an aromatic or aliphatic residue. 

The consensus pattern for C2H2 zinc fingers is: C-x(2,4)-C-x(3)-[LIVMFYWC]- 
x(8)-H-x(3,5)-H. The two C's and two H's are zinc ligands. 

u) Zinc Finger, CCHC Class. SEQ ID NO:322 corresponds to a polynucleotide 
encoding a novel member of the zinc finger CCHC family. The CCHC zinc finger protein 
30 family to date has been mostly composed of retroviral gag proteins (nucleocapsid). The 
prototype structure of this family is from HIV. The family also contains members involved 
in eukaryotic gene regulation, such as C. elegans GLH-1. The consensus sequence of this 
family is based upon the common structure of an 18-residue zinc finger. 
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v) Zinc-Binding Metalloprotease Domain. SEQ ID NO:306 and 395 represent 

polynucleotides encoding novel members of the zinc-binding metalloprotease domain 

protein family. The majority of zinc-dependent metallopeptidases (with the notable 

exception of the carboxypeptidases) share a common pattern of primary structure (Jongeneel 

5 etal.FEBSLett. (1989) 242:21 1; Murphy et aL, FEBS Lett. (1991) 259:4; and Bode et aL, 

Zoology (1996) 99:237) in the part of their sequence involved in the binding of zinc, and can 

be grouped together as a superfamily, known as the metzincins, on the basis of this sequence 

similarity. Examples of these proteins include: 1) Angiotensin-con verting enzyme (EC 

3.4.15.1) (dipeptidyl carboxypeptidase I) (ACE), the enzyme responsible for hydrolyzing 

10 angiotensin I to angiotensin II. 2) Mammalian extracellular matrix metalloproteinases 
(known as matrixins) (Woessner, FASEB J. (1991) 5:2145): MMP-1 (EC 3.4.24.7) 
(interstitial collagenase), MMP-2 (EC 3.4.24.24) (72 Kd gelatinase), MMP-9 (EC 3.4.24.35) 
(92 Kd gelatinase), MMP-7 (EC 3.4.24.23) (matrylisin), MMP-8 (EC 3.4.24.34) (neutrophil 
collagenase), MMP-3 (EC 3.4.24.17) (stromelysin-1), MMP-10 (EC 3.4.24.22) 

1 5 (stromelysin-2), and MMP-1 1 (stromelysin-3), MMP-1 2 (EC 3.4.24.65) (macrophage 
metalloelastase). 3) Endothelin-converting enzyme 1 (EC 3.4.24.71) (ECE-1), which 
processes the precursor of endothelin to release the active peptide. 

A signature pattern which includes the two histidine and the glutamic acid residues is 
sufficient to detect this superfamily of proteins, having the consensus pattern: [GSTALIVN]- 

20 x(2)-H-E-[LIVMFYW]-{DEHRKP}-H-x-[LIVMFYWGSPQ]. The two H's are zinc 
ligands, and E is the active site residue. 

Example 4: Differential Expression of Polynucleotides of the Invention : Description of 
Libraries and Detection of Differential Expression 

25 The relative expression levels of the polynucleotides of the invention was assessed in 

several libraries prepared from various sources, including cell lines and patient tissue 
samples. Table 4 provides a summary of these libraries, including the shortened library 
name (used hereafter), the mRNA source used to prepared the cDNA library, the "nickname" 
of the library that is used in the tables below (in quotes), and the approximate number of 

30 clones in the library. 
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Table 4 Description of cDNA Libraries 



Library 
(lib #) 


Description 


Number of 
Clones in this 
Clustering 


1 


Kml2L4 

Human Colon Cell Line, High Metastatic Potential (derived 

fromKml2C) 

"High Colon" 


307133 


2 


Kml2C 

Human Colon Cell Line, Low Metastatic Potential 
"Low Colon" 


284755 


3 


MDA-MB-231 

Human Breast Cancer Cell Line, High Metastatic Potential; 
micro-metastases in lung 
"High Breast" 


326937 


4 


MCF7 

Human Breast Cancer Cell, Non Metastatic 
"Low Breast" 


318979 


8 


MV-522 

Human Lung Cancer Cell Line, High Metastatic Potential 
"High Lung" 


223620 


9 


UCP-3 

Human Lung Cancer Cell Line, Low Metastatic Potential 
"Low Lung" 


312503 


12 


Human microvascular endothelial cells (HMEC) - Untreated 
PCR (OligodT) cDNA library 


41938 


13 


Human microvascular endothelial cells (HMEC) - bFGF 
treated 

PCR (OligodT) cDNA library 


42100 


14 


Human microvascular endothelial cells (HMEC) - VEGF 
treated 

PCR (OligodT) cDNA library 


42825 


15 


Normal Colon - UC#2 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


34285 


16 


Colon Tumor - UC#2 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


35625 


17 


Liver Metastasis from Colon Tumor of UC#2 Patient 
PCR (OligodT) cDNA library 
"High Colon Metastasis Tissue" 


36984 


18 


Normal Colon - UC#3 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


36216 


19 


Colon Tumor - UC#3 Patient 
PCR (OligodT) cDNA library 
"High Colon Tumor Tissue" 


41388 


20 


Liver Metastasis from Colon Tumor of UC#3 Patient 
PCR (OligodT) cDNA library 
"High Colon Metastasis Tissue" 


30956 
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The KM12L4 and KM12C cell lines are described in Example 1 above. The MDA- 

MB-231 cell line was originally isolated from pleural effusions (Cailleau, J. Natl Cancer. 
Inst. (1974) 55:661), is of high metastatic potential, and forms poorly differentiated 
adenocarcinoma grade II in nude mice consistent with breast carcinoma. The MCF7 cell line 
5 was derived from a pleural effusion of a breast adenocarcinoma and is non-metastatic. The 
MV-522 cell line is derived from a human lung carcinoma and is of high metastatic 
potential. The UCP-3 cell line is a low metastatic human lung carcinoma cell line; the MV- 
522 is a high metastatic variant of UCP-3. These cell lines are well-recognized in the art as 
models for the study of human breast and lung cancer (see, e.g., Chandrasekaran et al. 9 

10 Cancer Res. (1979) 3P:870 (MDA-MB-231 and MCF-7 ); Gastpar etal, JMedChem (1998) 
47:4965 (MDA-MB-231 and MCF-7); Ranson et al, Br J Cancer (1998) 77:1586 (MDA- 
MB-231 and MCF-7); Kuang et al, Nucleic Acids Res (1998) 26:1 1 16 (MDA-MB-23 1 and 
MCF-7); Varki et al. , Int J Cancer ( 1 987) 40:46 (UCP-3); Varki et al. , Tumour Biol. ( 1 990) 
77:327; (MV-522 and UCP-3); Varki et al, Anticancer Res. (1990) 70:637; (MV-522); 

15 Kelner et al, Anticancer Res (1995) 75:867 (MV-522); and Zhang et al, Anticancer Drugs 
(1997) 5:696 (MV522)). The samples of libraries 15-20 are derived from two different 
patients (UC#2, and UC#3). 

Each of the libraries is composed of a collection of cDNA clones that in turn are 
representative of the mRNAs expressed in the indicated mRNA source. In order to facilitate 

20 the analysis of the millions of sequences in each library, the sequences were assigned to 
clusters. The concept of "cluster of clones" is derived from a sorting/grouping of cDNA 
clones based on their hybridization pattern to a panel of roughly 300 7bp oligonucleotide 
probes (see Drmanac et al , Genomics (1 996) 57(1):29). Random cDNA clones from a 
tissue library are hybridized at moderate stringency to 300 7bp oligonucleotides. Each 

25 oligonucleotide has some measure of specific hybridization to that specific clone. The 
combination of 300 of these measures of hybridization for 300 probes equals the 
"hybridization signature" for a specific clone. Clones with similar sequence will have 
similar hybridization signatures. By developing a sorting/grouping algorithm to analyze 
these signatures, groups of clones in a library can be identified and brought together 

30 computationally. These groups of clones are termed "clusters". Depending on the 

stringency of the selection in the algorithm (similar to the stringency of hybridization in a 
classic library cDNA screening protocol), the "purity" of each cluster can be controlled. For 
example, artifacts of clustering may occur in computational clustering just as artifacts can 
occur in "wet-lab" screening of a cDNA library with 400 bp cDNA fragments, at even the 
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highest stringency. The stringency used in the implementation of cluster herein provides 

groups of clones that are in general from the same cDNA or closely related cDNAs. Closely 

related clones can be a result of different length clones of the same cDNA, closely related 

clones from highly related gene families, or splice variants of the same cDNA. 

5 Differential expression for a selected cluster was assessed by first determining the 

number of cDNA clones corresponding to the selected cluster in the first library (Clones in 

l sl ), and the determining the number of cDNA clones corresponding to the selected cluster in 

the second library (Clones in 2 nd ). Differential expression of the selected cluster in the first 

library relative to the second library is expressed as a "ratio" of percent expression between 

10 the two libraries. In general, the "ratio" is calculated by: 1) calculating the percent 
expression of the selected cluster in the first library by dividing the number of clones 
corresponding to a selected cluster in the first library by the total number of clones analyzed 
from the first library; 2) calculating the percent expression of the selected cluster in the 
second library by dividing the number of clones corresponding to a selected cluster in a 

15 second library by the total number of clones analyzed from the second library; 3) dividing 
the calculated percent expression from the first library by the calculated percent expression 
from the second library. If the "number of clones" corresponding to a selected cluster in a 
library is zero, the value is set at 1 to aid in calculation. The formula used in calculating the 
ratio takes into account the "depth" of each of the libraries being compared, le. , the total 

20 number of clones analyzed in each library. 

In general, a polynucleotide is said to be significantly differentially expressed 
between two samples when the ratio value is greater than at least about 2, preferably greater 
than at least about 3, more preferably greater than at least about 5 , where the ratio value is 
calculated using the method described above. The significance of differential expression is 

25 determined using a z score test (Zar, Biostatistical Analysis. Prentice Hall, Inc., USA, 
"Differences between Proportions," pp 296-298 (1974). 

Tables 5 to 7 (inserted before the claims) show the number of clones in each of the 
above libraries that were analyzed for differential expression. Examples of differentially 
expressed polynucleotides of particular interest are described in more detail below. 
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Example 5: Polynucleotides Differentially Expressed in High Metastatic Potential Breast 

Cancer Cells Versus Low Metastatic Breast Cancer Cells 

A number of polynucleotide sequences have been identified that are differentially 

expressed between cells derived from high metastatic potential breast cancer tissue and low 
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metastatic breast cancer cells. Expression of these sequences in breast cancer can be 

valuable in determining diagnostic, prognostic and/or treatment information. For example, 

sequences that are highly expressed in the high metastatic potential cells can be indicative of 

increased expression of genes or regulatory sequences involved in the metastatic process. A 

patient sample displaying an increased level of one or more of these polynucleotides may 

thus warrant more aggressive treatment. In another example, sequences that display higher 

expression in the low metastatic potential cells can be associated with genes or regulatory 

sequences that inhibit metastasis, and thus the expression of these polynucleotides in a 

sample may warrant a more positive prognosis than the gross pathology would suggest. 

The differential expression of these polynucleotides can be used as a diagnostic 
marker, a prognostic marker, for risk assessment, patient treatment and the like. These 
polynucleotide sequences can also be used in combination with other known molecular 
and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential breast cancer cells and low metastatic potential 
breast cancer cells. 
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Table 8. Differentially expressed polynucleotides: High metastatic potential breast 

cancer vs. low metastatic breast cancer cells 
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4 
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17 
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5 Example 6: Polynucleotides Differentially Expressed in High Metastatic Potential Lung 
Cancer Cells Versus Low Metastatic Lung Cancer Cells 
A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high metastatic potential lung cancer tissue and low 
metastatic lung cancer cells. Expression of these sequences in lung cancer tissue can be 
1 0 valuable in determining diagnostic, prognostic and/or treatment information. For example, 
sequences that are highly expressed in the high metastatic potential cells are associated can 
be indicative of increased expression of genes or regulatory sequences involved in the 
metastatic process. A patient sample displaying an increased level of one or more of these 
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polynucleotides may thus warrant more aggressive treatment. In another example, 

sequences that display higher expression in the low metastatic potential cells can be 

associated with genes or regulatory sequences that inhibit metastasis, and thus the expression 

of these polynucleotides in a sample may warrant a more positive prognosis than the gross 

pathology would suggest. 

The differential expression of these polynucleotides can be used as a diagnostic 
marker, a prognostic marker, for risk assessment, patient treatment and the like. These 
polynucleotide sequences can also be used in combination with other known molecular 
and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential lung cancer cells and low metastatic potential 
lung cancer cells: 



Table 9 Differentially expressed polynucleotides: High metastatic potential lung 
cancer vs. low metastatic lung cancer cells 



SEQID 


Differentia] Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


1" 
Library 


Library 




400 


High Lung > Low Lung (Lib8 > Lib 9) 


14929 


23 


16 


2.008868 


9 


High Lung > Low Lung (Lib8 > Lib9) 


2623 


6 


1 


8.384840 


34 


High Lung > Low Lung (Lib8 > Lib9) 


5832 


5 


0 


6.987366 


42 


High Lung > Low Lung (Lib8 > Lib9) 


307 


79 


27 


4.088903 


62 


High Lung > Low Lung (Lib8 > Lib9) 


2623 


6 


1 


8.384840 


74 


High Lung > Low Lung (Lib8 > Lib9) 


6268 


5 


0 


6.987366 


106 


High Lung > Low Lung (Lib8 > Lib9) 


10717 


8 


b 


11.17978 


119 


High Lung > Low Lung (Lib8 > Lib9) 


8 


1355 


122 


15.52111 


361 


High Lung > Low Lung (Lib8 > Lib9) 


1120 


5 


0 


6.987366 


369 


High Lung > Low Lung (Lib8 > Lib9) 


2790 


6 


0 


8.384840 


371 


High Lung > Low Lung (Lib8 > Lib9) 


8847 


6 


1 


8.384840 


379 


High Lung > Low Lung (Lib8 > Lib9) 


260 


15 


0 


20.96210 


395 


High Lung > Low Lung (Lib8 > Lib9) 


13538 


9 


1 


12.57726 


135 


Low Lung > High Lung (Lib9 > Lib8) 


36313 


30 


1 


21.46731 


154 


Low Lung > High Lung (Lib9 > Lib8) 


5345 


27 


6 


3.220097 


160 


Low Lung > High Lung (Lib9 > Lib8) 


4386 


21 


3 


5.009039 


260 


Low Lung > High Lung (Lib9 > Lib8) 


4141 


27 


4 


4.830145 


308 


Low Lung > High Lung (Lib9 > Lib8) 


15855 


213 


12 


12.70149 


323 


Low Lung > High Lung (Lib9 > Lib8) 


5257 


25 


5 


3.577885 


349 


Low Lung > High Lung (Lib9 > Lib8) 


2797 


14 


1 


10.01807 


381 


Low Lung > High Lung (Lib9 > Lib8) 


2428 


19 


2 


6.797982 



Example 7: Polynucleotides Differentially Expressed in High Metastatic Potential Colon 
Cancer Cells Versus Low Metastatic Colon Cancer Cells 
A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high metastatic potential colon cancer tissue and low 
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metastatic colon cancer cells. Expression of these sequences in colon cancer tissue can be 

valuable in determining diagnostic, prognostic and/or treatment information. For example, 

sequences that are highly expressed in the high metastatic potential cells can be indicative of 

increased expression of genes or regulatory sequences involved in the metastatic process. A 

patient sample displaying an increased level of one or more of these polynucleotides may 

thus warrant more aggressive treatment. In another example, sequences that display higher 

expression in the low metastatic potential cells can be associated with genes or regulatory 

sequences that inhibit metastasis, and thus the expression of these polynucleotides in a 

sample may warrant a more positive prognosis than the gross pathology would suggest. 

The differential expression of these polynucleotides can be used as a diagnostic 
marker, a prognostic marker, for risk assessment, patient treatment and the like. These 
polynucleotide sequences can also be used in combination with other known molecular 
and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential colon cancer cells and low metastatic potential 
colon cancer cells: 



Table 11: Differentially expressed polynucleotides: High metastatic potential colon 
cancer vs. low metastatic colon cancer cells 



SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


r 


2 nd 










Library 


Library 




1 


High Colon > Low Colon (Libl > Lib2) 


6660 


7 


0 


6.489973 


176 


High Colon > Low Colon (Libl > Lib2) 


3765 


19 


6 


2.935940 


241 


High Colon > Low Colon (Libl > Lib2) 


4275 


11 


2 


5.099264 


362 


High Colon > Low Colon (Libl > Lib2) 


6420 


S 


0 


7.417112 


374 


High Colon > Low Colon (Libl > Lib2) 


6420 


8 


0 


7.417112 


39 


Low Colon > High Colon (Lib2 > Libl) 


4016 


14 


5 


3.020043 


97 


Low Colon > High Colon (Lib2 > Libl) 


945 


21 


9 


2.516702 


134 


Low Colon > High Colon (Lib2 > Libl) 


2464 


19 


5 


4.098630 


317 


Low Colon > High Colon (Lib2 > Libl) 


1577 


40 


12 


3.595289 


357 


Low Colon > High Colon (Lib2 > Libl) 


4309 


13 


4 


3.505407 



Example 8: Polynucleotides Differentially Expressed at Higher Levels in High Metastatic 

Potential Colon Cancer Patient Tissue Versus Normal Patient Tissue 

A number of polynucleotide sequences have been identified that are differentially 

expressed between cells derived from high metastatic potential colon cancer tissue and 

normal tissue. Expression of these sequences in colon cancer tissue can be valuable in 

determining diagnostic, prognostic and/or treatment information. For example, sequences 

that are highly expressed in the high metastatic potential cells are associated can be 
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indicative of increased expression of genes or regulatory sequences involved in the advanced 

disease state which involves processes such as angiogenesis, dedifferentiation, cell 

replication, and metastasis. A patient sample displaying an increased level of one or more of 

these polynucleotides may thus warrant more aggressive treatment. 

The differential expression of these polynucleotides can be used as a diagnostic 
marker, a prognostic marker, for risk assessment, patient treatment and the like. These 
polynucleotide sequences can also be used in combination with other known molecular 
and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential colon cancer cells and normal colon cells: 



Table 11: Differentially expressed polynucleotides: High metastatic potential colon 
tissue vs. normal colon tissue 



SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


1 st 
Library 


2 nd 
Library 




52 


High Colon Metastasis Tissue > Normal 
Colon Tissue ofUC#3 (Lib20> Libl8) 


19 


10 


0 


11.69918 


52 


High Colon Metastasis Tissue > Normal 
Tissue in UC#2 (Lib 1 7 > Lib 1 5) 


19 


13 


2 


6.025646 


172 


High Colon Metastasis Tissue > Normal 
Tissue in UC#2 (Libl7 > Libl5) 


102 


65 


22 


2.738930 



Example 9: Polynucleotides Differentially Expressed at Higher Levels in High Colon 
Tumor Potential Patient Tissue Versus Metastasized Colon Cancer Patient 
Tissue 

A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high tumor potential colon cancer tissue and cells 
derived from high metastatic potential colon cancer cells. Expression of these sequences in 
colon cancer tissue can be valuable in determining diagnostic, prognostic and/or treatment 
information associated with the transformation of precancerous tissue to malignant tissue. 
This information can be useful in the prevention of achieving the advanced malignant state 
in these tissues, and can be important in risk assessment for a patient. 

The following table summarizes identified polynucleotides with differential 
expression between high tumor potential colon cancer tissue and cells derived from high 
metastatic potential colon cancer cells: 



113 



WO 99/33982 PCT/US98/27610 
Table 12: Differentially expressed polynucleotides: High tumor potential colon tissue 





vs. metastatic colon tissue 












Differential Expression 


Cluster 


yii _ • -f St 

Clones in 1 


Clones in 2 


Ratio 


NO. 




ID 


Library 


Library 




52 


High Colon Tumor Tissue > Metastasis 


19 


69 


10 


5.160829 




Tissue of UC#3 (Lib 19 > Lib20) 










119 


High Colon Tumor Tissue > Metastasis 


8 


14 


1 


10.47124 




Tissue of UC#3 (Lib 19 > Lib20) 










172 


High Colon Tumor Tissue > Metastasis 


102 


43 


10 


3.216168 




Tissue of UC#3 (Lib 19 > Lib20) 











5 Example 10: Polynucleotides Differentially Expressed at Higher Levels in High Tumor 
Potential Colon Cancer Patient Tissue Versus Normal Patient Tissue 
A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high tumor potential colon cancer tissue and normal 
tissue. Expression of these sequences in colon cancer tissue can be valuable in determining 
1 0 diagnostic, prognostic and/or treatment information associated with the prevention of 

achieving the malignant state in these tissues, and can be important in risk assessment for a 
patient. For example, sequences that are highly expressed in the potential colon cancer cells 
are associated with or can be indicative of increased expression of genes or regulatory 
sequences involved in early tumor progression. A patient sample displaying an increased 
1 5 level of one or more of these polynucleotides may thus warrant closer attention or more 
frequent screening procedures to catch the malignant state as early as possible. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential colon cancer cells and normal colon cells: 

20 Table 13: Differentially expressed polynucleotides: High tumor potential colon tissue 
vs. normal colon tissue 



SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


I s ' 


2 nd 




52 






Library 


Library 




High Colon Tumor Tissue > Normal Tissue 
ofUC#2(Libl6>Libl5) 


19 


13 


2 


6.255508 


288 


High Colon Tumor Tissue > Normal Tissue 
ofUC#2 (Libl6>Libl5) 


1267 


7 


0 


6.125253 


52 


High Colon Tumor Tissue > Normal Tissue 
ofUC#3 (Libl9>Libl8) 


19 


69 


0 


60.37750 


119 


High Colon Tumor Tissue > Normal Tissue 
ofUC#3 (Libl9>Libl8) 


8 


14 


1 


12.25050 


172 


High Colon Tumor Tissue > Normal Tissue 


102 


43 


7 


5.375222 



ofUC#3(Libl9>Libl8) 
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Example 1 1 : Polynucleotides Differentially Expressed Across Multiple Libraries 

A number of polynucleotide sequences have been identified that are differentially 
expressed between cancerous cells and normal cells across all three tissue types tested (i.e., 
5 breast, colon, and lung). Expression of these sequences in a tissue or any origin can be 
valuable in determining diagnostic, prognostic and/or treatment information associated with 
the prevention of achieving the malignant state in these tissues, and can be important in risk 
assessment for a patient. These polynucleotides can also serve as non-tissue specific 
markers of, for example, risk of metastasis of a tumor. The following table summarizes 
10 identified polynucleotides that were differentially expressed but without tissue type- 
specificity in the breast, colon, and lung libraries tested. 



Table 14: Polynucleotides Differentially Expressed Across Multiple Library Comparisons 



SEQID 


Differentia! Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


1 st 


2 nd 










Library 


Library 




9 


High Breast > Low Breast (Lib3 > Lib4) 


2623 


31 


4 


7.561356 




High Lung > Low Lung (Lib8 > Lib9) 


2623 


6 


1 


8.384840 


39 


Low Breast > High Breast (Lib4 > Lib3) 


4016 


6 


0 


6.149690 




Low Colon > High Colon (Lib2 > Libl) 


4016 


14 


5 


3.020043 


42 


High Breast > Low Breast (Lib3 > Lib4) 


307 


196 


75 


2.549721 




High Lung > Low Lung (Lib8 > Lib9) 


307 


79 


27 


4.088903 


52 


High Breast > Low Breast (Lib3 > Lib4) 


19 


1364 


525 


2.534854 




High Colon Metastasis Tissue > Normal 


19 


10 


0 


11.69918 




Colon Tissue of UC#3 (Lib20 > Lib 18) 












High Colon Metastasis Tissue > Normal 


19 


13 


2 


6.025646 




Tissue inUC#2 (Libl 7 > Libl 5) 












High Colon Tumor Tissue > Metastasis 


19 


69 


10 


5.160829 




Tissue of UC#3 (Lib 19 > Lib20) 












High Colon Tumor Tissue > Normal Tissue 


19 


13 


2 


6.255508 




ofUC#2 (Libl6>Libl5) 












High Colon Tumor Tissue > Normal Tissue 


19 


69 


0 


60.37750 




ofUC#3 (Libl9>Libl8) 










62 


High Breast > Low Breast (Lib3 > Lib4) 


2623 


31 


4 


7.561356 




High Lung > Low Lung (Lib8 > Lib9) 


2623 


6 


1 


8.384840 


74 


High Lung > Low Lung (Lib8 > Lib9) 


6268 


5 


0 


6.987366 




Low Breast > High Breast (Lib4 > Lib3) 


6268 


18 


3 


6.149690 


119 


High Colon Tumor Tissue > Metastasis 


8 


14 


1 


10.47124 




Tissue of UC#3 (Libl9 > Lib20) 












High Colon Tumor Tissue > Normal Tissue 


8 


14 


1 


12.25050 




ofUC#3 (Libl 9 > Libl 8) 












High Lung > Low Lung (Lib8 > Lib9) 


8 


1355 


122 


15.52111 


172 


High Breast > Low Breast (Lib3 > Lib4) 


102 


278 


116 


2.338217 




High Colon Metastasis Tissue > Normal 


102 


65 


22 


2.738930 




Tissue in UC#2 (Lib 1 7 > Libl 5) 












High Colon Tumor Tissue > Metastasis 


102 


43 


10 


3.216168 
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SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 


ID 


1" 


2 nd 










Library 


Library 






Tissue of UC#3 (Libl 9 > Lib20) 












High Colon Tumor Tissue > Normal Tissue 


102 


43 


7 


5.375222 




ofUC#3 (Libl 9 > Libl 8) 










317 


High Breast > Low Breast (Lib3 > Lib4) 


1577 


25 


3 


8.130490 




Low Colon > High Colon (Lib2 > Libl) 


1577 


40 


12 


3.595289 


379 


High Breast > Low Breast (Lib3 > Lib4) 


260 


27 


2 


13.17139 




High Lung > Low Lung (Lib8 > Lib9) 


260 


15 


0 


20.96210 



Example 12: Polynucleotides Exhibiting Colon-Specific Expression 

The cDNA libraries described herein were also analyzed to identify those 
polynucleotides that were specifically expressed in colon cells or tissue, Le. 9 the 
5 polynucleotides were identified in libraries prepared from colon cell lines or tissue, but not 
in libraries of breast or lung origin. The polynucleotides that were expressed in a colon cell 
line and/or in colon tissue, but were present in the breast or lung cDNA libraries described 
herein, are shown in Table 15. 



Table 15 Polynucleotides specifically expressed in colon cells. 




SEQID 


Cluster 


Clones in Clones in 


SEQID 


Cluster 


Clones in < 


NO. 




r 


Library 2 nd Library 


NO. 




I s * Library 2' 


5 


36535 


2 


0 


229 


39648 


2 0 


13 


27250 


2 


0 


231 


85064 


1 0 


19 


16283 


3 


0 


234 


39391 


2 0 


24 


16918 


4 


0 


236 


39498 


2 0 


26 


40108 


2 


0 


242 


22113 


3 0 


32 


32663 


1 


1 


247 


19255 


2 0 


43 


39833 


2 


0 


252 


22814 


3 0 


47 


18957 


3 


0 


253 


39563 


2 0 


48 


39508 


2 


0 


254 


39420 


2 0 


56 


7005 


8 


2 


257 


39412 


2 0 


58 


18957 


3 


0 


261 


38085 


2 0 


59 


18957 


3 


0 


265 


40054 


1 0 


60 


16283 


3 


0 


266 


39423 


2 0 


64 


13238 


4 


1 


267 


39453 


2 0 


70 


39442 


2 


0 


270 


78091 


1 0 


71 


17036 


4 


0 


276 


39168 


2 0 


73 


7005 


8 


2 


277 


39458 


2 0 


83 


11476 


6 


0 


278 


14391 


3 1 


86 


39425 


2 


0 


279 


39195 


2 0 


94 


21847 


2 


1 


282 


12977 


5 0 


100 


16731 


3 


1 


284 


14391 


3 1 


101 


12439 


4 


0 


290 


16347 


4 0 


113 


17055 


4 


0 


293 


39478 


2 0 


120 


67907 


1 


0 


294 


39392 


2 0 


121 


12081 


4 


0 


297 


39180 


2 0 


124 


39174 


2 


0 


299 


6867 


7 3 
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SEQID 


Cluster 


Clones in Clones in 


SEQ ID 


Cluster 


Clones in Clones in 


NO. 




1" 


Library 2 nd Library 


NO. 




I s ' 


Library 2 nd Library 


126 


8210 


2 


6 


301 


41633 


I 


1 


128 


40455 


2 


0 


302 


23218 


3 


0 


139 


22195 


3 


0 


303 


39380 


2 


0 


143 


86859 


1 


0 


309 


84328 


1 


0 


150 


8672 


4 


4 


314 


14367 


3 


0 


153 


16977 


4 


0 


320 


39886 


2 


0 


156 


17036 


4 


0 


324 


9061 


5 


2 


159 


40044 


2 


0 


327 


16653 


3 


1 


161 


40044 


2 


0 


328 


16985 


4 


0 


163 


22155 


3 


0 


329 


12977 


5 


0 


166 


15066 


4 


0 


330 


9061 


5 


2 


170 


11465 


5 


0 


333 


16392 


3 


0 


176 


3765 


19 


6 


342 


39486 


2 


0 


181 


86110 


1 


0 


344 


6874 


6 


3 


182 


39648 


2 


0 


345 


6874 


6 


3 


185 


17076 


4 


0 


353 


11494 


4 


0 


186 


22794 


2 


0 


354 


17062 


3 


0 


187 


39171 


2 


0 


355 


16245 


4 


0 


194 


40455 


2 


0 


356 


83103 


1 


0 


199 


16317 


3 


0 


358 


13072 


4 


1 


210 


39186 


2 


0 


366 


14364 


1 


0 


211 


40122 


2 


0 


368 


84182 


1 


0 


218 


26295 


2 


0 


372 


56020 


1 


0 


222 


4665 


5 


9 


389 


7514 


5 


3 


226 


82498 


1 


0 


391 


7570 


5 


3 


227 


35702 


2 


0 


393 


23210 


3 


0 



In addition to the above, SEQ ID NOS:159 and 161 were each present in one clone in 
each of Libl6 (Normal Colon Tumor Tissue), and SEQ ID NOS:344 and 345 were each 
present in one clone in Libl7 (High Colon Metastasis Tissue). No clones corresponding to 
the colon-specific polynucleotides in the table above were present in any of Libraries 3, 4, 8, 
or 9. The polynucleotide provided above can be used as markers of cells of colon origin, and 
find particular use in reference arrays, as described above. 

Example 13: Identification of Contiguous Sequences Having a Polynucleotide of the 
Invention 

The novel polynucleotides were used to screen publicly available and proprietary 
databases to determine if any of the polynucleotides of SEQ ID NOS: 1-404 would facilitate 
identification of a contiguous sequence, e.g., the polynucleotides would provide sequence 
that would result in 5' extension of another DNA sequence, resulting in production of a 
longer contiguous sequence composed of the provided polynucleotide and the other DNA 
sequence(s). Contiging was performed using the AssemblyLign program with the following 
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parameters: 1) Overlap: Minimum Overlap Length: 30; % Stringency: 50; Minimum 

Repeat Length: 30; Alignment: gap creation penalty: 1.00, gap extension penalty: 1.00; 2) 

Consensus: % Base designation threshold: 80. 

Using these parameters, 44 polynucleotides provided contiged sequences. These 
contiged sequences are provided as SEQ ID NOS:801-844. The contiged sequences can be 
correlated with the sequences of SEQ ID NOS: 1-404 upon which the contiged sequences are 
based by identifying those sequences of SEQ ID NOS: 1-404 and the contiged sequences of 
SEQ ID NOS: 80 1-844 that share the same clone name in Table 1 . It should be noted that of 
these 44 sequences that provided a contiged sequence, the following members of that group 
of 44 did not contig using the overlap settings indicated in parentheses (Stringency/Overlap): 
SEQ ID NO:804 (30%/10); SEQ ID NO:810 (20%/20); SEQ ID NO:812 (30%/10); SEQ ID 
NO:814 (40%/20); SEQ ID NO:816 (30%/10); SEQ ID NO:832 (30%/10); SEQ ID NO:840 
(20%/20); SEQ ID NO:841 (40%/20). To generalize, the indicated polynucleotides did not 
contig using a minimum 20% stringency, 10 overlap. There was a corresponding increase in 
the number of degenerate codons in these sequences. 

The contiged sequences (SEQ ID NO:801-844) thus represent longer sequences that 
encompass a polynucleotide sequence of the invention. The contiged sequences were then 
translated in all three reading frames to determine the best alignment with individual 
sequences using the BLAST programs as described above for SEQ ID NOS: 1-404 and the 
validation sequences SEQ ID NOS:405-800. Again the sequences were masked using the 
XBLAST profram for masking low complexity as described above in Example 1 (Table 2). 
Several of the contiged sequences were found to encode polypeptides having characteristics 
of a polypeptide belonging to a known protein families (and thus represent new members of 
these protein families) and/or comprising a known functional domain (Table 16). Thus the 
invention encompasses fragments, fusions, and variants of such polynucleotides that retain 
biological activity associated with the protein family and/or functional domain identified 
herein. 



118 



WO 99/33982 



PCT/US98/27610 



Table 16, Profile hits using contiged sequences 



SEQ 

ID 

NO. 


Sequence Name 


Profile 


Start 
(Stop) 


Score 


809 


Contig_RTA00000 1 77AF.n. 1 8.3 . 

ieq_ 1 hie J l JlD 1 


ATPases 


778 
(1612) 


6040 


824 


Contig RTA00000187AF.g.24.1. 
Seq_THC 168636 


homeobox 


53 1 
(707) 


12080 


824 


Contig RTA00000187AF.g.24.1. 
SeqJTHC 168636 


MAP kinase kinase 


769 
(1494) 


5784 


833 


Contig RTA00000190AFJ.4.1. 
Seq_THC228776 


protein kinase 


170 
(1010) 


5027 


833 


Contig RTA00000190AF.j.4.1. 
Seq_THC228776 


protein kinase 


170 
(1010) 


5027 



All stop/start sequences are provided in the forward direction. 



The profiles for the ATPases (AAA) and protein kinase families are described above 
in Example 2. The homeobox and MAP kinase kinase protein families are described further 
below. 

Homeobox domain. The 'homeobox' is a protein domain of 60 amino acids (Gehring 
In: Guidebook to the Homeobox Genes , Duboule D., Ed., ppl-10, Oxford University Press, 
Oxford, (1994); Buerglin In: Guidebook to the Homeobox Genes . pp25-72, Oxford 
University Press, Oxford, (1994); Gehring Trends Biochem. ScL (1992) 77:277-280; Gehring 
etalAnnu. Rev. Genet. (1986) 20:147-173; Schofield Trends Neurosci. (1987) 70:3-6; 
http://copan.bioz.unibas.ch/ homeo.html) first identified in number of Drosophila homeotic 
and segmentation proteins. It is extremely well conserved in many other animals, including 
vertebrates. This domain binds DNA through a helix-turn-helix type of structure. Several 
proteins that contain a homeobox domain play an important role in development. Most of 
these proteins are sequence-specific DNA-binding transcription factors. The homeobox 
domain is also very similar to a region of the yeast mating type proteins. These are 
sequence-specific DNA-binding proteins that act as master switches in yeast differentiation 
by controlling gene expression in a cell type-specific fashion. 

A schematic representation of the homeobox domain is shown below. The helix- 
turn-helix region is shown by the symbols 'H* (for helix), and 't' (for turn). 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxHHHHHHHHtttHHHHHHHH^xxxxxxxxxx 
1 60 
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The pattern detects homeobox sequences 24 residues long and spans positions 34 to 57 of the 

homeobox domain. The consensus pattern is as follows: [LIVMFYG]-[ASLVR]-x(2)- 

|LIVMSTACN]-x-[LIVM]-x(4)4LIV]-[RKNQESTAIY]4LIVFSTNKH]-W 

[NDQTAH]-x(5)-[RKNAIMW], 

5 MAP kinase kinase (MAPKKV MAP kinases (MAPK) are involved in signal 

transduction, and are important in cell cycle and cell growth controls. The MAP kinase 

kinases (MAPKK) are dual-specificity protein kinases which phosphorylate and activate 

MAP kinases. MAPKK homologues have been found in yeast, invertebrates, amphibians, 

and mammals. Moreover, the MAPKK/MAPK phosphorylation switch constitutes a basic 

10 module activated in distinct pathways in yeast and in vertebrates. MAPKK regulation 
studies have led to the discovery of at least four MAPKK convergent pathways in higher 
organisms. One of these is similar to the yeast pheromone response pathway which includes 
the stel 1 protein kinase. Two other pathways require the activation of either one or both of 
the serine/threonine kinase-encoded oncogenes c-Raf-1 and c-Mos. Additionally, several 

15 studies suggest a possible effect of the cell cycle control regulator cyclin-dependent kinase 1 
(cdc2) on MAPKK activity. Finally, MAPKKs are apparently essential transducers through 
which signals must pass before reaching the nucleus. For review, see, e.g., Biologique Biol 
Cell (1993) 79:193-207; Nishida et ai, Trends Biochem Sci (1993) 75:128-31; Ruderman 
Curr Opin Cell Biol (1993) 5:207-13; Dhanasekaran et ai, Oncogene (1998) 77:1447-55; 

20 Kiefer et al. 9 Biochem Soc Trans (1997) 25:491-8; and Hill, Cell Signal (1996) 5:533-44. 

Those skilled in the art will recognize, or be able to ascertain, using not more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such specific embodiments and equivalents are intended to be 
encompassed by the following claims. 

25 All publications and patent applications cited in this specification are herein 

incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
admission that the present invention is not entitled to antedate such publication by virtue of 

30 prior invention. 
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Although the foregoing invention has been described in some detail by way of 

illustration and example for purposes of clarity of understanding, it is readily apparent to 

those of ordinary skill in the art in light of the teachings of this invention that certain changes 

and modifications may be made thereto without departing from the spirit or scope of the 

appended claims. 

Deposit Information : 

The following materials were deposited with the American Type Culture Collection: 
CMCC = (Chiron Master Culture Collection) 



Cell Lines Deposited with ATCC 



Cell Line 


Deposit Date 


ATCC Accession No. 


CMCC Accession No. 


KM12L4-A 


March 19, 1998 


CRL- 12496 


11606 


Kml2C 


May 15, 1998 


CRL- 12533 


11611 


MDA-MB-231 


May 15, 1998 


CRL-12532 


10583 


MCF-7 


October 9, 1998 


CRL-12584 


10377 
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cDNA Library ESI - ATCC# 
Deposit Date - December 22, 1998 



Clone Name 


Cluster ID 


Sequence Name 


M00001395A:C03 


4016 


79.Al.sp6:130016.Seq 


M00001395A:C03 


4016 


RTA00000118A.C.4.1 


M00001449A:D12 


3681 


RTA00000131A.g.l5.2 


M00001449A:D12 


3681 


79.El.sp6:130064.Seq 


M00001452A:D08 


1120 


79.C2.sp6: 130041. Seq 


M00001452A:D08 


1120 


RTA00000118A.p.l5.3 


M00001513A:B06 


4568 


79.D4.sp6:130055.Seq 


M00001513A:B06 


4568 


RTA00000122A.d.l5.3 


M00001517A:B07 


4313 


79.F4.sp6:130079.Seq 


M00001517A:B07 


4313 


RTA00000122A.n.3.1 


M00001533A:C11 


2428 


RTA00000123A.1.21.1 


MO0O01533A:Cll 


2428 


79.A5.sp6:130020.Seq 


MO0O01533A:Cll 


2428 


RTA00000123A.1.21 . 1 .Seq_THC205063 


M00001542A:A09 


22113 


79.F5.sp6:130080.Seq 


M00001542A:A09 


22113 


RTA00000125A.C.7.1 



cDNA Library ES2 - ATCC# 
Deposit Date - December 22, 1998 



Clone Name 


Cluster ID 


Sequence Name 


M00001343C:F10 


2790 


80.El.sp6:130256.Seq 


M00001343C:F10 


2790 


RTAO0000 1 77AF.e.2. 1 .Seq_THC22946 1 


M00001343C:F10 


2790 


RTA00000177AF.e.2.1 


M00001343D:H07 


23255 


100.Cl.sp6:131446.Seq 


M00001343D:H07 


23255 


RTAOOOOO 1 77AF.e. 1 4.3 .Seq_THC228776 


M00001343D:H07 


23255 


80.Fl.sp6:130268.Seq 


M00001343D:H07 


23255 


RTAOOOOO 177AF.e. 14.3 


M00001345A:E01 


6420 


172.El.sp6:133925.Seq 


M00001345A:E01 


6420 


RTAOOOOO 177AF.f. 10.3 


M00001345A:E01 


6420 


RTAOOOOO 1 77AF.f. 10.3.Seq_THC226443 


M00001345A:E01 


6420 


80.Gl.sp6:130280.Seq 


M00001347A:B10 


13576 


80.D2.sp6:130245.Seq 


M00001347A:B10 


13576 


100.El.sp6:131470.Seq 


M00001347A:B10 


13576 


RTA00000177AF.g.l6.1 


M00001353A:G12 


8078 


80.E3.sp6:130258.Seq 


M00001353A:G12 


8078 


RTAOOOOO 1 77 AR.l. 13.1 


M00001353A:G12 


8078 


172.C3.sp6:133903.Seq 


M00001353D:D10 


14929 


RTAOOOOO 177AF.m. 1.2 


M00001353D:D10 


14929 


80.F3.sp6:130270.Seq 


M00001353D:D10 


14929 


172.D3.sp6:133915.Seq 


M00001361A:A05 


4141 


80.B4.sp6:130223.Seq 


MO0OO1361A:A05 


4141 


RTAOOOOO 177AF.p.20.3 


M00001362B:D10 


5622 


80.D4.sp6:130247.Seq 


M00001362B:D10 


5622 


RTAOOOOO 178AF.a. 11.1 
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Clone Name 

M00001362C:H11 

M00001362C:H11 

M00001362C:H11 

M00001362C:H11 

M00001376B:G06 

M00001376B:G06 

M00001387A:C05 

M00001387A:C05 

M00001412B:B10 

M00001412B:B10 

M00001415A:H06 

M00001415A:H06 

M00001416B:H11 

M00001416B:H11 

M00001429D:D07 

M00001429D:D07 

M00001448D.-H01 

M00001448D:H01 



Cluster ID 


Sequence Name 


945 


RTA00000178AR.a.20.l 


945 


100.E4.sp6:131473.Seq 


945 


80.E4.sp6:130259.Seq 


945 


180.C2.sp6:135940.Seq 


17732 


RTA00000178AR.i.2.2 


17732 


80.B5.sp6:130224.Seq 


2464 


80.D6.sp6:130249.Seq 


2464 


RTA00000178AF.n.l8.1 


8551 


RTA00000179AF.p.21.1 


8551 


80.G7.sp6:130286.Seq 


13538 


80.B8.sp6:130227.Seq 


13538 


RTA00000180AF.a.24.1 


8847 


80.C8.sp6:130239.Seq 


8847 


RTA00000180AF.b.l6.1 


40392 


RTA00000180AF.j.8.1 


40392 


80.H9.sp6:130300.Seq 


36313 


80.All.sp6:130218.Seq 


36313 


RTA00000181AF.e.23.1 



cDNA Library ES4 - ATCC# 
Deposit Date - December 22, 1998 


Clone Name 


Cluster ID 


M00001463C:B11 


19 


M00001463C:B11 


19 


M00001470A:B10 


1037 


M00001470A:B10 


1037 


M00001497A:G02 


2623 


M00001497A:G02 


2623 


M00001500A:EU 


2623 


M00001500A:E11 


2623 


M00001501D.C02 


9685 


M00001501D:C02 


9685 


M00001501D:C02 


9685 


M00001504C.H06 


6974 


M00001504C:H06 


6974 


M00001504C:H06 


6974 


M00001504D.G06 


6420 


M00001504D:G06 


6420 


M00001504D:G06 


6420 


M00001504D:G06 


6420 


M00001528A:C04 


35555 


M00001528A:C04 


7337 


M00001528A:C04 


35555 



Sequence Name 

RTAO00O0182AF.b.7.1 

89.Dl.sp6:130703.Seq 

89.F2.sp6:130728.Seq 

RTA00000121A.f.8.1 

89.F3.sp6:130729.Seq 

RTA00000183AF.a.6.1 

RTA00000183AF.b.l4.1 

89.A4.sp6:130670.Seq 

RTAO0O00 1 83AF.C. 11.1 .Seq_THCl 09544 

RTA00000183AF.C.11.1 

89.C4.sp6:130694.Seq 

89.F4.sp6:130730.Seq 

RTAO0000183AF.d.9.1 

RTA00000 1 83AF.d.9. 1 .Seq_THC223 1 29 

173.F5.SP6:134133.Seq 

89.G4.sp6:130742.Seq 

RTA000001 83AF.d. 11.1 .Seq_THC226443 

RTA00000183AF.d.ll.l 

89.B6.sp6:130684.Seq 

RTA00000123A.b.l7.1 

184.A5.sp6:135530.Seq 
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WO 99/33982 
cDNA Library ESS - ATCC# 
Deposit Date - December 22, 1998 



PCT/US98/27610 



Clone Name 


Cluster ID 


Sequence Name 


M00001537B:G07 


3389 


RTA00000183AF.m.l9.1 


M00001537B:G07 


3389 


89.A8.sp6:130674.Seq 


M00001541A:D02 


3765 


89.C8.sp6:130698.Seq 


M00001541A:D02 


3765 


RTA00000135A.d.l.l 


M00001544B:B07 


6974 


89.A9.sp6:130675.Seq 


M00001544B:B07 


6974 


RTA00000184AF.a.l5.1 


M00001546A:G11 


1267 


89.D9.sp6:130711.Seq 


M00001546A:G1 1 


1267 


RTA00000125A.O.5.1 


M00001549B:F06 


4193 


89.G9.sp6:130747.Seq 


M00001549B:F06 


4193 


RTA00000184AF.e.l3.1 


M00001556A:F11 


1577 


173.C9.SP6:134101.Seq 


M00001556A:F11 


1577 


89.Fll.sp6:130737.Seq 


M00001556A:F11 


1577 


RTA00000184AF,i.23.i 


M00001556B:C08 


4386 


RTA00000184AF.j.4.1 


M00001556B:C08 


4386 


89.Hll.sp6: 130761. Seq 


cDNA Library ES6 


- ATCC# 




Deposit Date - December 22, 1998 




Clone Name 


Cluster ID 


Sequence Name 


M00001563B:F06 


102 


RTA00000184AF.O.5.1 


MO0001563B:F06 


102 


90.Bl.sp6: 130871. Seq 


M00001571C:H06 


5749 


90.El.sp6:130907.Seq 


M00001571C:H06 


5749 


RTA00000185AF.a.l9.1 


M00001594B:H04 


260 


90.D2.sp6:130896.Seq 


M00001594B:H04 


260 


RTA00000185AR,i.l2.2 


M00001597C:H02 


4837 


90.E2.sp6:130908.Seq 


M00001597C:H02 


4837 


RTA00000185AR.k.3.2 


M00001624C:F01 


4309 


90.C4.sp6:130886.Seq 


M00001624C:F01 


4309 


RTA00000186AF.e.22.1 


M00001679A:A06 


6660 


90.F6.sp6:130924.Seq 


M00001679A:A06 


6660 


122.B5.sp6:132089.Seq 


M00001679A:A06 


6660 


RTA00000187AF.h.l5.1 


M00003759B:B09 


697 


90.G8.sp6:130938.Seq 


M00003759B:B09 


697 


RTA00000188AF.d.6.1 


M00003759B:B09 


697 


RTA0O000 1 88 AF.d.6. 1 .Seq_THC 1 78884 


M00003844C:B11 


6539 


176.D9.sp6:134556.Seq 


M00003844C:B11 


6539 


RTA00000189AF.d.22.1 


M00003844C:B11 


6539 


90.B10.sp6:130880.Seq 


M00003857A:G10 


3389 


90.All.sp6:130869.Seq 


M00003857A:G10 


3389 


RTA00000189AF.g.3.1 
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WO 99/33982 
cDNA Library ES7 - ATCC# 
Deposit Date - December 22, 1998 



PCT/US98/27610 



Clone Name 


Cluster ID 


Sequence Name 


M00003914C:F05 


3900 


99.El.sp6:131278.Seq 


M00003914C:F05 


3900 


RTA00000190AF.g.l3.1 


M00003922A:E06 


23255 


RTA00000190AF.j.4.1 


M00003922A:E06 


23255 


99.Fl.sp6:131290.Seq 


M00003922A:E06 


23255 


RTA00000190AF.j.4.1.Seq_THC228776 


MO00O3983A:A05 


9105 


99.C3.sp6:131256.Seq 


MOO0O3983A:A05 


9105 


RTA00000191AF.a.21.2 


M00004028D:A06 


6124 


RTA00000191AR.e.2.3 


M00004028D:A06 


6124 


99.D3.sp6:131268.Seq 


M00004031A:A12 


9061 


RTA00000191AR.e.ll.2 


M00004031A:A12 


9061 


RTA00000191AR.e.ll.3 


M00004087D:A01 


6880 


RTA00000191AF.m.20.1 


M00004087D:A01 


6880 


99.A5.sp6:131234.Seq 


M00004108A:E06 


4937 


99.E5.sp6:131282.Seq 


M00004108A:E06 


4937 


RTA00000191AF.p.21.l 


M00004114C:F11 


13183 


123.D5.sp6:132305.Seq 


M00004114C:F11 


13183 


RTA00000192AF.a.24.1 


M000041I4C:F11 


13183 


99.G5.sp6:131306.Seq 



cDNA Library ES8 - ATCC# 
Deposit Date - December 22, 1998 



Clone Name 


Cluster ID 


Sequence Name 


M00004146C:C11 


5257 


99.B6.sp6:131247.Seq 


M00004146C:C11 


5257 


177.F5.sp6:134768.Seq 


M00004146C:C11 


5257 


RTA00000192AF.f.3.1 


M00004146C:C11 


5257 


RTA00000192AF.f.3. 1 .Seq_THC2 13833 


M00004157C:A09 


6455 


RTA00000192AF.g.23.1 


M00004157C:A09 


6455 


99.D6.sp6:131271.Seq 


M00004157C:A09 


6455 


123.E7.sp6:132319.Seq 


M00004172C:D08 


11494 


RTA00000192AF.j.6.1 


M00004172C:D08 


11494 


99.G6.sp6:131307.Seq 


M00004172C:D08 


11494 


177.E6.sp6:134757.Seq 


M00004229B:F08 


6455 


RTA00000193AF.b.9.1 


M00004229B:F08 


6455 


99.C8.sp6:131261.Seq 


cDNA Library ES9 - 


ATCC# 




Deposit Date - December 22, 1998 





Clone Name 


Cluster ID 


Sequence Name 


M00001466A:E07 


4275 


RTA00000120A.j.l4.1 


M00001531A:H11 




89.F6.sp6:130732.Seq 


M00001531A:H11 




RTA00000123A.g.l9.1 


MO0OO1551A:B10 


6268 


79.G9.sp6:130096.Seq 


M00001551A:B10 


6268 


184.C12.sp6:135561.Seq 


M00001551A:B10 


6268 


RTA00000126A.O.23.1 


M00001552A:B12 


307 


RTA00000136A.O.4.2 


M00001552A:B12 


307 


79.C7.sp6:130046.Seq 


M00001556A:H01 


15855 


RTA00000184AFJ.1.1 


M00001586C:C05 


4623 


RTA00000185AF.f.4.1 


M00001604A:B10 


1399 


79.G8.sp6:130095.Seq 


M00001604A:B10 


1399 


RTA00000129A.O.10.1 


M00003879B:C11 


5345 


RTA00000189AF.1.19.1 


M00003879B:C11 


5345 


90.B12.sp6:130882.Seq 






125 



WO 99/33982 PCT/US98/27610 
cDNA Library ES 1 0 - ATCC# 



Deposit Date - December 22, 1998 



Clone Name 


Cluster ID 


Sequence Name 


M00001358C:C06 




RTA00000177AF.O.4.3 


M00001388D:G05 


5832 


80.F6.sp6:130273.Seq 


M00001388D:G05 


5832 


RTA00000178AF.O.23.1 


M00001394A:F01 


6583 


RTA00000179AF.d.l3.1 


M00001394A:F01 


6583 


172.B8.sp6: 133896. Seq 


M00001394A:F01 


6583 


80.H6.sp6:130297.Seq 


M00001429A:H04 


2797 


RTA00000180AF.U9.1 


M00001447A:G03 


10717 


RTA00000181AF.d.l0.1 


M00001448D:C09 


8 


80.H10.sp6: 130301. Seq 


M00001448D:C09 


8 


RTA00000181AF.e.l7.1 


M00OO1448D:C09 


8 


100.Bll.sp6:131444.Seq 


M00001454D:G03 


689 


RTA00000181AR.1.22.1 


cDNA Library ES11 


- ATCC# 




Deposit Date - December 22, 1998 




Clone Name 


Cluster ID 


Sequence Name 


M00003975A:G11 


12439 


RTA00000 1 90AF.O.24. 1 


M00003978B:G05 


5693 


RTA00000 1 90AF.p. 1 7.2.Seq_THC 1 733 1 8 


M00003978B:G05 


5693 


RTA00000190AF.p.l7.2 


M00004059A:D06 


5417 


RTA00000191AF.h.l9.1 


M00004068B:A01 


3706 


99.C4.sp6:131257.Seq 


M00004068B:A01 


3706 


RTA00000191AF.i.l7.2 


M00004205D:F06 




99.E7.sp6:131284.Seq 


M00004205D:F06 




177.G7.sp6:134782.Seq 


M0000420SD:F06 




RTA00000192AF.O.11.1 


M00004212B:C07 


2379 


RTA00000192AF.p.8.1 


M00004223A:G10 


16918 


RTA00000 1 93 AF.a. 1 6. 1 


cDNA Library ESI 2 


- ATCC# 




Deposit Date - December 22, 1998 




Clone Name 


Cluster ID 


Sequence Name 


M00004223B:D09 


7899 


RTA00000193AF.a.l7.1 


M00004249D:G12 




RTA00000193AF.C.22.1 


M00004251C:G07 




RTA00000193AF.d.2.1 


M00004372A:A03 


2030 


RTA00000193AF.m.20.1 



126 



WO 99/33982 
cDNA Library ESI 3 - ATCC# 
Deposit Date - December 22, 1998 



PCT/US98/27610 



Clone Name 

M00001340B:A06 

M00001340B:A06 

M00001340D:F10 

M00001340D:F10 

M00001341A:E12 

M00001341A:E12 

M00001342B:E06 

M00001342B:E06 

M00001346A:F09 

M00001346A:F09 

M00001346D:G06 

M00001346D:G06 

M00001348B:B04 

M00001348B:B04 

M00001348B:G06 

M00001348B:G06 

M00001349B:B08 

M00001349B:B08 

M00001350A:H01 

M00001350A:H01 

M00001350A:H01 

M00001352A:E02 

M00001352A:E02 

M00001352A:E02 

M00001355B:G10 

M00001355B:G10 

M00001355B:G10 

M00001355B:G10 

M00001361D:F08 

M00001361D:F08 

M00001365C:C10 

M00001365C:C10 

M00001368D:E03 

M00001368D:E03 

M00001370A:C09 

M00001370A:C09 

M00001371C:E09 

M00001371C:E09 

M00001371C:E09 

M00001378B:B02 

M00001378B:B02 

M0O001379A:AO5 

M00001379A:A05 

MOOOO1380D:BO9 

M00001380D:B09 

M00001381D:E06 

M00001381D:E06 

M00001382C:A02 

M00001382C:A02 

M00001384B:A11 

M00001384B:A11 

M00001386C:B12 



Cluster ID 

17062 

17062 

11589 

11589 

4443 

4443 

39805 

39805 

5007 

5007 

5779 

5779 

16927 

16927 

16985 

16985 

3584 

3584 

7187 

7187 

7187 

16245 

16245 

16245 

14391 

14391 

14391 

14391 

2379 

2379 

40132 

40132 



6867 

6867 

7172 

7172 

7172 

39833 

39833 

1334 

1334 

39886 

39886 



22979 
22979 



5178 



Sequence Name 

80.Al.sp6:130208.Seq 

RTA00000177AF.b.8.4 

80.Bl.sp6:130220.Seq 

RTA00000177AF.b.l7.4 

80.Cl.sp6:130232.Seq 

RTA00000177AF.b.20.4 

80.Dl.sp6:130244.Seq 

RTA00000177AF.C.21.3 

RTA00000177AF.g.2.1 

80.Hl.sp6:130292.Seq 

RTA00000177AF.g.l4.3 

RTA00000177AF.g.l4.1 

80.E2.sp6:130257.Seq 

RTA00000177AF.h.9.3 

RTA00000177AF.h.l0.1 

80.F2.sp6:130269.Seq 

RTA00000177AF.h.20.1 

80.G2.sp6:130281.Seq 

100.C2.sp6:131447.Seq 

80.A3.sp6:130210.Seq 

RTA00000177AF.i.8.2 

RTA00000177AF.k.9.3 

172.D2.sp6:133914.Seq 

80.D3.sp6:130246.Seq 

RTA00000177AF.m.l7.3 

80.G3.sp6:130282.Seq 

172.H3.sp6:133963.Seq 

100.E3.sp6:131472.Seq 

80.C4.sp6:130235.Seq 

RTA00000178AF.a.6.1 

RTA00000178AF.C.7.1 

80.F4.sp6:130271.Seq 

80.G4.sp6:130283.Seq 

RTA00000178AF.d.20.1 

80.H4.sp6:130295.Seq 

RTA00000178AF.e.l2.1 

100.A5.sp6:131426.Seq 

RTA00000178AF.f.9.1 

80.A5.sp6:130212.Seq 

80.C5.sp6:130236.Seq 

RTA00000178AF.L23.1 

80.D5.sp6:130248.Seq 

RTA00000178AF.j.7.1 

RTA00000178AF.j.24.1 

80.E5.sp6:130260.Seq 

80.F5.sp6:130272.Seq 

RTA00000178AF.k.l6.1 

80.G5.sp6:130284.Seq 

RTA00000178AF.k.22.1 

80.B6.sp6:130225.Seq 

RTA00000178AF.m.l3.1 

80.C6.sp6:130237.Seq 
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WO 99/33982 
cDNA Library ES 1 3 - ATCC# 
Deposit Date - December 22, 1998 



PCT/US98/27610 



Clone Name 

M00001386C:B12 

M00001387B:G03 

MOO0O1387B:GO3 

M0O001389A:C08 

M00001389A:C08 

M00001396A:C03 

M00001396A:C03 

M00001396A:C03 

M00001400B:H06 

M00001400B:H06 

M00001400B:H06 

MOO0O1400B:HO6 

M00001402A:E08 

M00001402A:E08 

M00001407B:D1 1 

M00001407B:D1 1 

M00001410A:D07 

M00001410A:D07 

M00001410A:D07 

M00001414A:B01 

M00001414A:B01 

M00001414C:A07 

M00001414C:A07 

M00001416A:H01 

M00001416A:H01 

M00001417A:E02 

M00001417A:E02 

M00001423B:E07 

M00001423B:E07 

M00001424B:G09 

M00001424B:G09 

M00001425B:H08 

M00001425B:H08 

M00001426B:D12 

M00001426B:D12 

M00001426D:C08 

M00001426D:C08 

M00001428A:H10 

M00001428A:H10 

M00001428A:H10 

M00001449A:A12 

M00001449A:A12 

M00001449A:B12 

M00001449A:B12 

M00001449A:G10 

M00001449A:G10 

M00001449A:G10 

M00001449C:D06 

M00001449C:D06 

M00001450A:A02 

M00001450A:A02 

M00001450A:A02 



Cluster ID 

5178 

7587 

7587 

16269 

16269 

4009 

4009 

4009 



39563 

39563 

5556 

5556 

7005 

7005 

7005 



7674 

7674 

36393 

36393 

15066 

15066 

10470 

10470 

22195 

22195 



4261 

4261 

84182 

84182 

84182 

5857 

5857 

41633 

41633 

36535 

36535 

36535 

86110 

86110 

39304 

39304 

39304 



Sequence Name 

RTA00000178AF.n.l0.1 

80.E6.sp6:130261.Seq 

RTA00000178AF.n.24.1 

RTA00000178AF.p.l.l 

80.G6.sp6:130285.Seq 

172.D8.sp6:133920.Seq 

80.A7.sp6:130214.Seq 

RTA00000179AF.e.20.1 

172.B9.sp6.133897.Seq 

80.B7.sp6:130226.Seq 

RTA00000179AF.j.l3.1 

RTA000001 79AF.j. 13.1 .Seq_THC 1 05720 

80.C7.sp6:130238.Seq 

RTA00000179AF.k.20.1 

RTAOO00O179AF.n.lO.l 

80.D7.sp6:130250.Seq 

18O.H5.sp6:136003.Seq 

RTA00000179AF.O.22.1 

80.F7.sp6:130274.Seq 

RTA00000180AF.a.9.1 

80.H7.sp6:130298.Seq 

80.A8.sp6:130215.Seq 

RTA00000180AF.a.ll.l 

79. Cl.sp6:130040.Seq 
RTA00000118A.g.9.1 
RTA00000180AF.C.2.1 

80. D8.sp6:130251.Seq 
RTA00000180AF.e.24.1 
80.H8.sp6:130299.Seq 
80.A9.sp6:130216.Seq 
RTA00000180AF.f.l8.1 
RTA00000180AF.g.7.1 
80.B9.sp6:130228.Seq 
RTA00000180AF.g.22.1 
80.C9.sp6:130240.Seq 
80.D9.sp6:130252.Seq 
RTA00000180AF.h.5.1 
10O.G9.sp6:1315O2.Seq 
RTA00000180AF.h.l9.1 
80.E9.sp6:130264.Seq 
80.Bll.sp6:130230.Seq 
RTA00000118A.g.l4.1 
80.Cll.sp6:130242.Seq 
RTA00000U8A.g.l6.1 
RTAOOO0O181AF.f.5.1 
80.Dll.sp6:130254.Seq 
10O.Dll.sp6: 13 1468.Seq 
RTA00000181AF.f.l2.1 
80.Ell.sp6:130266.Seq 

RTA00000 1 1 8A.j .21.1 .Seq_THC 1 5 1 859 

RTA00000118A.j.21.1 

79.Fl.sp6:130076.Seq 
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WO 99/33982 PCT/US98/27610 
cDNA Library ESI 3 - ATCC# 



Deposit Date - December 22, 1998 



Clone Name 


Cluster ID 


Sequence Name 


M00001450A:A02 


39304 


180.G9.sp6:135995.Seq 


M00001450A:A11 


32663 


80.Fll.sp6:130278.Seq 


M00001450A:A11 


32663 


RTA00000118A. 1.8.1 


M00001450A:B12 


82498 


100.Fll.sp6:131492.Seq 


M00001450A:B12 


82498 


RTA00000118A.m.l0.1 


M00001450A:B12 


82498 


79.Gl.sp6:130088.Seq 


M00001450A:D08 


27250 


80.Gll.sp6:130290.Seq 


M00001450A:D08 


27250 


180.B10.sp6:135936.Seq 


M00001450A:D08 


27250 


RTA00000181AF.g.l0.1 


M00001452A:B04 


84328 


RTA00000118A.p.l0.l 


M00001452A:B04 


84328 


79.A2.sp6:130017.Seq 


M00001452A:B12 


86859 


RTA00000118A.p.8.1 


M00001452A:B12 


86859 


79.B2.sp6:130029.Seq 


M00001452A:F05 


85064 


RTA00000131A.m.23.1 


M00001452A:F05 


85064 


79.D2.sp6:130053.Seq 


M00001452C:B06 


16970 


80.Hll.sp6:130302.Seq 


M00001452C:B06 


16970 


100.C12.sp6:131457.Seq 


M00001452C:B06 


16970 


RTA00000181AR.1.18.2 


M00001453A:E11 


16130 


80.A12.sp6:130219.Seq 


M00001453A:EI1 


16130 


100.D12.sp6: 13 1469.Seq 


M00001453A:E11 


16130 


RTA00000119A.C.13.1 


M00001453C:F06 


16653 


80.BI2.sp6:130231.Seq 


M00001453C:F06 


16653 


RTA00000181AF.k.5.3 


M00001454A:A09 


83103 


RTA00000119A.e.24.2 


M00001454A:A09 


83103 


79.G2.sp6:130089.Seq 


M00001454B:C12 


7005 


121.Dl.sp6:131917.Seq 


M00001454B:C12 


7005 


RTA00000181AF.k.24.1 


M00001454B:C12 


7005 


80.C12.sp6:130243.Seq 


M00001455B:E12 


13072 


80.F12.sp6:130279.Seq 


M00001455B:E12 


13072 


RTA00000181AR.m.5.2 


M00001460A:F06 


2448 


89.Al.sp6:130667.Seq 


M00001460A:F06 


2448 


RTA00000119A.j.21.1 


M00001461A:D06 


1531 


89.Cl.sp6:130691.Seq 


M00001461A:D06 


1531 


RTA00000119A.O.3.1 


M00001465A:B11 


10145 


79.F3.sp6:130078.Seq 


M00001465A:B11 


10145 


RTA00000120A.g.l2.1 


M00001467A:B07 


38759 


89.Fl.sp6:130727.Seq 


M00001467A:B07 


38759 


RTA00000120A.m.l2.3 


M00001467A:D04 


39508 


RTA00000120A.O.2.1 


M00001467A:D04 


39508 


89.Gl.sp6:130739.Seq 


M00001467A:E10 


39442 


89.A2.sp6:130668.Seq 


M00001467A:E10 


39442 


RTA00000120A.O.21.1 


M00001468A:F05 


7589 


RTA00000120A.p.23.1 


M00001468A:F05 


7589 


89.B2.sp6:130680.Seq 


M00001469A:A01 




RTA00000121A.C.10.1 


M00001469A:A01 




89.C2.sp6:130692.Seq 


M00001469A:C10 


12081 


89.D2.sp6:130704.Seq 


M00001469A:C10 


12081 


RTA00000133A.d.l4.2 


M00001469A:H12 


19105 


89.E2.sp6:130716.Seq 


M00001469A:H12 


19105 


RTA00000133A.e.l5.1 


M00001470A:C04 


39425 


89.G2.sp6:130740.Seq 


M00001470A:C04 


39425 


RTA00000133A.f.l.l 
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Clone Name 

M00001471A:B01 

M00001471A:B01 

M00001487B:H06 

M0O0OI487B:H06 

M000OI488B:F12 

M00001488B:F12 

M00001494D:F06 

M00001494D:F06 

M00001499B:A11 

M00001499B:A11 

M00001499B:A11 

M0O0O150OA:C05 

M0O0O150OA:C05 

M00001504A:E01 

M00001504A:E01 

M00001504A:E01 

M00001504C:A07 

M00001504C:A07 

M00001505C:C05 

M00001505C:C05 

M00001506D:A09 

M00001506D:A09 

M00001506D:A09 

M00001507A:H05 

M00001507A:H05 

M00001535A:F10 

M00001535A:F10 

M00001541A:H03 

M00001541A:H03 

M00001544A:G02 

M00001544A:G02 

M00001545A:D08 

M00001545A:D08 

M0O0O1551A:F05 

M0O0O1551A:F0S 

M00001552A:D11 

M00001552A:D1 1 

M00001557A:F03 



7206 

7206 

10539 

10539 

10539 

5336 

5336 



10185 
10185 



Cluster ID Sequence Name 
39478 89.H2.sp6:130752.Seq 
39478 RTA00000133A.L5.1 

RTA00000182AF.I.15.1 
89.B3.sp6:130681.Seq 
RTA00000182AF.I.20.1 
89.C3.sp6:130693.Seq 
RTA00000182AF.O.15.1 
89.E3.sp6:130717.Seq 
RTA00000183AF.a.24.1 
89.G3.sp6:130741.Seq 
173.B5.SP6:134085.Seq 
RTA00000183AF.b.l3.1 
89.H3.sp6:130753.Seq 
RTA00000183AF.C.24.1 
89.D4.sp6:130706.Seq 
RTA00000 1 83 AF.c.24. 1 .Seq_THC 1259 1 2 
RTA00000183AF.d.5.1 
89.E4.sp6:130718.Seq 
89.H4.sp6:130754.Seq 
RTA00000183AF.e.l.l 
89.A5.sp6:130671.Seq 
RTA00000183AF.e.23.1 
121.G6.sp6:131958.Seq 
39168 RTA00000121A.1.10.1 
39168 89.B5.sp6:l 30683. Seq 

39423 79.C5.sp6:130044.Seq 
39423 RTA00000134A.k.22.1 
39174 79.E5.sp6:130068.Seq 
39174 RTA00000124A.n.l3.1 
19829 79.H5.sp6:130104.Seq 
19829 RTA00000125A.h.24.4 
13864 RTA00000125A.m.9.1 
13864 79.B6.sp6:130033.Seq 
39180 RTA00000126A.n.8.2 
39180 79.A7.sp6:130022.Seq 
39458 RTA00000126A.p.l5.2 
39458 79.D7.sp6:130058.Seq 
39490 RTA00000128A.b.4.1 
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Clone Name 

M00001511A:H06 

M00001511A:H06 

M00001512A:A09 

M00001512A:A09 

M00001512D:G09 

M00001512D:G09 

M00001512D:G09 

M0OOO1513B:GO3 

M00001513B:G03 

M00001513B:G03 

M00001513C:E08 

M00001513C:E08 

M00001514C:D11 

M00001514C:D11 

M00001514C:D11 

M00001518C:B11 

M00001518C:B11 

M00001528B:H04 

M00001528B:H04 

M00001531A:D01 

M00001531A:D01 

M00001534A:C04 

M00001534A:C04 

M00001534A:D09 

M00001534A:D09 

M00001534C:A01 

M00001534C:A01 

M00001535A:C06 

M00001535A:C06 

M00001535A:C06 

M00001536A:B07 

M00001536A:B07 

M00001537A:F12 

M00001537A:F12 

M00001540A:D06 

M00001540A:D06 

M00001542A:E06 

M00001542A:E06 

M00001544A:E06 

M00001544A:E06 

M00001544A:E06 

M00001545A:B02 

M00001545A:B02 

M00001548A:E10 

M00001548A:E10 

M00001548A:E10 

M00001549C:E06 

M00001549C:E06 

M0000155OA:A03 

M00001550A:A03 

M0OO0155OA:G01 

M0OO0155OA:G01 



Cluster ID 

39412 

39412 

39186 

39186 

3956 

3956 

3956 



14364 

14364 

40044 

40044 

40044 

8952 

8952 

8358 

8358 

38085 

38085 

16921 

16921 

5097 

5097 

4119 

4119 

20212 

20212 

20212 

2696 

2696 

39420 

39420 

8286 

8286 

39453 

39453 



5892 

5892 

5892 

16347 

16347 

7239 

7239 

5175 

5175 



Sequence Name 

RTA00000133A.k.l7.1 

89.C5.sp6:130695.Seq 

89.D5.sp6:130707.Seq 

RTA00000121A.p.l5.1 

89.E5.sp6:130719.Seq 

173.H5.SP6:134157.Seq 

RTA00000183AF.g.3.1 

RTA00000183AF.g.9.1 

89.F5.sp6:130731.Seq 

RTAOOOOO 1 83 AF.g.9. 1 .Seq_THC 1 98280 

RTA00000183AF.g.l2.1 

89.G5.sp6:130743.Seq 

RTA00000183AF.g.22.1 

RTAOOOOO 1 83 AF.g.22. 1 .Seq_THC232899 

89.H5.sp6:130755.Seq 

89.A6.sp6:130672.Seq 

RTA00000183AF.h.l5.1 

89.D6.sp6:130708.Seq 

RTAOOOOO 1 83 AF.i.5.1 

RTA00000123A.e.l5.1 

89.E6.sp6:130720.Seq 

RTA00000183AF.k.6.1 

89.H6.sp6:130756.Seq 

RTAOOOOO 134A.k. 1.1 

RTAOOOOO 134A.k. 1 . 1 .Seq_THC2 1 5869 

RTAOOOOO 183AF.k. 16.1 

89.C7.sp6:130697.Seq 

89.E7.sp6:130721.Seq 

RTAOOOOO 1 34 A.1.22. 1 .Seq_THC 1 28232 

RTAOOOOO 134A.1.22.1 

RTA00000134A.m.l3.1 

89.F7.sp6:130733.Seq 

89.H7.sp6:130757.Seq 

RTA00000134A.O.23.1 

89.B8.sp6:130686.Seq 

RTAOOOOO 1 83 AF.o. 1.1 

89.E8.sp6:130722.Seq 

RTAOOOOO 1 35 A.g. 11.1 

RTAOOOOO 1 84 AF.a.8.1 

173.G7.SP6:134147.Seq 

89.H8.sp6:130758.Seq 

89.B9.sp6:130687.Seq 

RTA00000135A.I.2.2 

89.E9.sp6:130723.Seq 

RTAOOOOO 1 84 AF.d. 11.1 

RTAOOOOO 1 84AF.d. 11.1 .Seq_THC 1 61 896 

89.H9.sp6:130759.Seq 

RTA00000184AF.e.l5.1 

89.A10.sp6:130676.Seq 

RTAOOOOO 126A.m.4.2 

RTAOOOOO 1 84 AF.f.3.1 

89.B10.sp6:130688.Seq 
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Clone Name 


Cluster ID 


Sequence Name 


M0O0O1551A:GO6 


22390 


RTA00000136AJ.13.1 


M00001551A:G06 


22390 


89.C10.sp6:130700.Seq 


M000O1551C:GO9 


3266 


RTA00000184AR.g.l.l 


M000O1551C:GO9 


3266 


89.D10.sp6:130712.Seq 


M00001553A:H06 


8298 


RTA00000127A.d.l9.1 


M00001553A:H06 


8298 


89.G10.sp6:130748.Seq 


M000O1553B:F12 


4573 


89.H10.sp6:130760.Seq 


M000O1553B:F12 


4573 


RTA00000184AF.h.9.1 


M000O1555A:BO2 


39539 


RTA00000127A.i.21.1 


M0O0O1555A:B02 


39539 


89.Bll.sp6:130689.Seq 


M000O1555A:C01 


39195 


89.Cll.sp6:130701.Seq 


M00001555A:C01 


39195 


RTA00000137A.C.16.1 


M00001555D:G10 


4561 


RTA00000184AF.i.21.1 


M000O1555D:G10 


4561 


89.Dll.sp6:130713.Seq 


M00001556A:C09 


9244 


89.Ell.sp6:130725.Seq 


M00001556A:C09 


9244 


RTA00000127A.I.3.1 


M00001556B:G02 


11294 


RTA00000184AF.j.6.1 


M00001556B:G02 


11294 


89.A12.sp6:130678.Seq 


M00001557B:H10 


5192 


173.E9.SP6:134125.Seq 


MOO0O1557B:H10 


5192 


RTA00000184AF.k.2.1 


M00001557B:H10 


5192 


89.D12.sp6:130714.Seq 


M00001557D:D09 


8761 


RTA00000 184AF.lt. 12.1 


M00001557D:D09 


8761 


89.E12.sp6:130726.Seq 


M00001558B:H11 


7514 


RTA00000184AF.k.21.1 


M00001558B:H11 


7514 


89.G12.sp6:130750.Seq 


M00001559B:F01 




89.H12.sp6:130762.Seq 


M00001559B:F01 




RTA00000184AF.1.11.1 


M00001560D:F10 


6558 


90.Al.sp6:130859.Seq 


M00001560D:F10 


6558 


RTA00000184AF.m.21.1 


M00001566B:D11 




RTA00000184AF.p.3.1 


M00001566B:D11 




90.Dl.sp6:130895.Seq 


M00001583D:A10 


6293 


RTA00000185AF.e.ll.l 


M000O1583D:A10 


6293 


90.A2.sp6:130860.Seq 


M00001590B:F03 




RTA00000185AF.g.ll.l 


M00001590B:F03 




90.C2.sp6:130884.Seq 


M00001597D:C05 


10470 


RTA00000185AF.k.6.1 


M00001597D:C05 


10470 


90.F2.sp6:130920.Seq 


M00001598A:G03 


16999 


90.G2.sp6:130932.Seq 


M00001598A:G03 


16999 


RTA00000185AF.k.9.1 


M00001601A:D08 


22794 


RTA00000138A.b.5.1 


M00001601A:D08 


22794 


90.H2.sp6:130944.Seq 


M00001607A:E11 


11465 


RTA00000185AF.m.l9.1 


M00001607A:E11 


11465 


90.A3.sp6:130861.Seq 


M00001608A:B03 


7802 


RTA00000185AF.n.5.1 


M00001608A:B03 


7802 


90.B3.sp6:130873.Seq 


M00001608B:E03 


22155 


RTA00000185AF.n.9.1 


M00001608B:E03 


22155 


90.C3.sp6:130885.Seq 


M00001608D:A11 




RTA00000185AF.n.l2.1 


M00001608D:A11 




90.D3.sp6:130897.Seq 


M00001614C:F10 


13157 


RTA00000186AF.a.6.1 


M00001614C:F10 


13157 


90.E3.sp6:130909.Seq 


M00001617C:E02 


17004 


RTA00000186AF.b.21.1 
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Clone Name 


Cluster ID 


Sequence Name 


M00001617C:E02 


17004 


90.F3.sp6:130921.Seq 


M00001619C:F12 


40314 


90.G3.sp6:130933.Seq 


M00001619C:F12 


40314 


RTA00000186AF.C.15.1 


MQ0001621C:C08 


40044 


RTA00000186AF.d.l.l 


M00001621C:C08 


40044 


RTA000001 86AF.d. 1 . 1 .Seq_THC232899 


M00001621CC08 


40044 


90 H3 sd6- 130945 Sea 


M00001621C:C08 


40044 


1 22 .E 1 .sp6 : 1 32 1 2 1 . Seq 


M00001623D-F10 


13913 


RTA00000186AFe6 1 

XX X t AV W V V 1 \J \J * AX * v* w* 1 


M00001623D-F10 


13913 


90 A4 so6* 130862 Sea 


M00001632D:H07 




RTAOO000 1 86AF.h. 14.1 .Seq_THC 1 1 2525 


M00001632DH07 




RTA00000186AF h 14 1 


M00001632D:H07 




90 E4 sn6'130910 Sea 


M00001632D:H07 




176 A3 sd6 134514 Sea 


M00001644CB07 


39171 


RTA00000186AF 1 7 1 


M00001644C'B07 


39171 


90 F4 snfi* 130927 Sen 


M00001644CB07 


39171 


217 A12 snfi-119169 Sen 


M00001645AC12 

ITIVVVV 1 V~«^#X»X* 1 A. 


19267 


RTA00000186AF 1 12 1 Sea THri78183 


IVf0000164SAd9 


19267 


1 76 (13 <;n6- 1 34SR6 Sen 


M00001645A'C12 


19267 


RTA000001S6AF 1 17 1 


M00001645A-C12 


19267 


90 G4 sn6- 130934 Sea 


M00001648C-A01 


4665 


90 H4 sn6- 130946 Sea 


M00001648CA01 

iTiW W J VTTUViriV 1 


4665 


RTA000001 86AF m 3 l 

i\ I rivvUVU 1 OVA1 tlWtJtl 


M00001657DC03 

1TXW W I \J*s / Aj^ • \J 


23201 


RTA00000187AF a 14 1 


M00001657DC03 


23201 


90 B5 sd6- 130875 Sea 


M00001657DF08 

ItAWW I V/ *p/ / A_^ # A V/ v 


76760 


90 C5 sd6- 130887 Sea 


M00001657DF08 


76760 


RTA00000187AF a 15 1 


M00001662OA09 


23218 


RTA000001 87AR c 5 2 


M00001662C-A09 


23218 


90 D5 st)6- 130899 Sea 


M00001663AE04 


35702 


90 F5 sn6- 13091 1 Sea 


M00001663AE04 


35702 


RTA00000187AR c 15 2 


M00001669B:F02 


6468 


90.F5.sp6:130923.Seq 


M00001669B:F02 


6468 


RTA00000187AF d 15 1 

XX A I*. WvWl \J t i 11 tUil Jil 


M00001670C:H02 


14367 


90.G5.sp6: 130935. Seq 


M00001670C:H02 


14367 


RTA000001 87AF.e.8.1 


M00001673C;H02 


7015 


90.H5.sp6;130947.Seq 


M00001673C:H02 


7015 


RTA0O000 1 87 AF.f. 18.1 


M00001675A:C09 


8773 


RTA00000187AF.f.24.1 


M00001675A:C09 


8773 


90.A6.sp6:130864.Seq 


M00001675A:C09 


8773 


RTA00000187AF f 24 1 Sea THC220002 


MQ0001676BF05 

A ▼ a w V V 1 w 1 V Am/ * X w^/ 


11460 


RTA00000187AF e 12 1 

XX A AV WW 1 U f Al li>. 1 


M00001676B:F05 


11460 


90 B6 sd6* 130876 Sea 


M00001676B:F05 


11460 


219 F2 sd6 139035 Sea 


M00001677DA07 

ATA W V V A X/ 1 / A^ ■ XA V / 


7570 


90 D6 sd6-130900 Sea 


M00001677D:A07 


7570 


RTA00000187AF.g.24.1 


M00001677D:A07 


7570 


RTA000001 87AF.g.24.1 .Seq_THCl 68636 


M00001678D:F12 


4416 


90.E6.sp6:130912.Seq 


M00001678D:F12 


4416 


RTA00000187AF.h.l3.1 


M00001679A:F10 


26875 


RTA00000187AF.i.l.l 


M00001679A:F10 


26875 


90.A7.sp6:130865.Seq 


M00001679B:F01 


6298 


90.B7.sp6:130877.Seq 


M00001679B:F01 


6298 


RTA00000187AR.i.l0.2 


M00001680D:F08 


10539 


90.F7.sp6:130925.Seq 
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Clone Name 

M00001680D:F08 

M00001680D:F08 

M00001682C:B12 

M00001682C:B12 

M00001682C:B12 

M00001688C:F09 

M00001688C:F09 

M00001693C:G01 

M00001693C:G01 

M00001716D:H05 

M00001716D:H05 

M00003741D:C09 

M00003741D:C09 

M00003747D:C05 

M00003747D:C05 

M00003747D:C05 

M00003747D:C05 

M00003754C:E09 

M00003754C:E09 

M00003761D:A09 

M00003761D:A09 

M00003761D:A09 

M00003762C:B08 

M00003762C:B08 

M00003762C:B08 

M00003763A:FO6 

M00003763A:F06 

M00003774C:A03 

M00003774C:A03 

M00003774C:A03 

M00003784D:D12 

M00003784D:D12 

M00003839A:D08 

M00003839A:D08 

M00003851B:D08 

M00003851B:D08 

M00003851B:D10 

M00003851B:D10 

M00003853A:D04 

M00003853A-.D04 

M00003853A:F12 

M00003853A:F12 

M00003856B:C02 

M00003856B:C02 

M00003857A:H03 

M00003857A:H03 

M00003857A:H03 



Cluster ID 

10539 

10539 

17055 

17055 

17055 

5382 

5382 

4393 

4393 

67252 

67252 

40108 

40108 

11476 

11476 

11476 

11476 



17076 

17076 

17076 

3108 

3108 

67907 

67907 

67907 



7798 
7798 



13595 

13595 

5619 

5619 

10515 

10515 

4622 

4622 

4718 

4718 

4718 



Sequence Name 

219.F6.sp6:139039.Seq 

RTA00000187AF.1.7.1 

90.G7.sp6:130937.Seq 

RTA00000187AF.m.3.1 

176.D6.sp6:134553.Seq 

90.A8.sp6:130866.Seq 

RTA00000187AF.m.23.2 

RTA00000187AF.n.l7.1 

90.B8.sp6:130878.Seq 

RTA00000187AF.O.6.1 

90.C8.sp6:130890.Seq 

90.D8.sp6:130902.Seq 

RTA00000187AF.O.24.1 

RTA00000187AF.p.l9.1 

9O.E8.sp6:130914.Seq 

RTA000001 87AF.p.l9.1 .Seq_THC108482 

219.H8.sp6:139065.Seq 

90.F8.sp6:130926.Seq 

RTA00000188AF.b.l2.1 

RTA0O00O188AF.d.ll.l 

90.H8.sp6:130950.Seq 

RTA000001 88AF.d. 11.1 .Seq_THC21 2094 

RTA000001 88AF.d.2 1 . 1 .Seq_THC208760 

90.A9.sp6:130867.Seq 

RTA00000188AF.d.21.1 

RTA00000188AF.d.24.1 

90.B9.sp6:130879.Seq 

RTA0O000 1 88AF.g. 11.1 .Seq_THC 1 23222 

RTA00000188AF.g.ll.l 

90.C9.sp6:130891.Seq 

RTA00000188AF.i.8.1 

90.D9.sp6:130903.Seq 

RTA00000189AF.C.18.1 

90.A10.sp6:130868.Seq 

90.D10.sp6:130904.Seq 

RTA0O000189AF.f.7.1 

90.E10.sp6:130916.Seq 

RTA00000189AF.f.8.1 

90.F10.sp6:130928.Seq 

RTA00000189AF.f.l7.1 

90.G10.sp6:130940.Seq 

RTA00000189AF.f.l8.1 

90.H10.sp6:130952.Seq 

RTA00000189AF.g.l.l 

90.Bll.sp6:130881.Seq 

RTA00000 1 89AF.g.5 . 1 .Seq_THC 1 96 1 02 

RTA00000189AF.g.5.1 
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Clone Name 
M00003867A:D10 
M00003867A:D10 
M00003871C:E02 
M00003875C:G07 
M00003875C:G07 
M00003875D:D1 1 
M00003875D:D1 1 
M00003876D:E12 
M00003876D:E12 
M00003906C:E10 
M00003906C:E10 
M00003907D:A09 
M00003907D:A09 
M00003907D:A09 
M00003907D:H04 
M00003907D:H04 
M00003909D:C03 
M00003909D:C03 
M00003968B:F06 
M00003968B:F06 
M00003970C:B09 
M00003970C:B09 
M00003970C:B09 
M00003974D:E07 
M00003974D:E07 
M00003974D:E07 
M00003974D:H02 
M00003974D:H02 
M00003974D:H02 
M00003981A:E10 
M00003981A:E10 
M00003982C:C02 
M00003982C:C02 
M00003982C:C02 
M00004028D:C05 
M00004028D:C05 
M00004035C:A07 
M00004035C:A07 
M00004035D:B06 
M00004035D:B06 
M00004072A:C03 
M00004072A:C03 
M00004081C:D10 
M00004081C:D10 
MOOO04086D:GO6 
M00004086D:G06 
M00004105C:A04 
M00004105C:A04 
M00004171D:B03 
M00004171D:B03 
M00004185C:C03 
M00004185C:C03 



Cluster ID 



4573 
8479 
8479 



7798 

7798 

9285 

9285 

39809 

39809 

39809 

16317 

16317 

8672 

8672 

24488 

24488 

40122 

40122 

40122 

23210 

23210 

23210 

23358 

23358 

23358 

3430 

3430 

2433 

2433 

2433 

40073 

40073 

37285 

37285 

17036 

17036 



15069 

15069 

9285 

9285 

7221 

7221 

4908 

4908 

11443 

11443 



Sequence Name 

90.Cll.sp6:130893.Seq 

RTA00000189AF.h.l7.1 

RTA00000189AF.j.l2.1 

90.G1 l.sp6:130941.Seq 

RTA00000189AFJ.22.1 

9O.HU.sp6:130953.Seq 

RTA00000189AFJ.23.1 

90.A12.sp6:130870.Seq 

RTA00000189AF.k.l2.1 

9O.H12.sp6:130954.Seq 

RTA00000190AF.d.7.1 

99.Al.sp6:131230.Seq 

RTA00000 1 90AF.e.3 . 1 .Seq_THC 1 502 1 7 

RTA00000190AF.e.3.1 

99.Bl.sp6:131242.Seq 

RTA00000190AF.e.6.1 

RTA00000190AF.f.ll.l 

99.Cl.sp6:131254.Seq 

RTA00000190AF.n.l6.1 

99.C2.sp6:131255.Seq 

RTA00000190AF.n.23.1 

RTAO0000 1 90AF.n.23 . 1 . Seq_THC 1 09227 

99.D2.sp6:131267.Seq 

RTA00000190AF.O.20.1 

RTAO0000 1 90AF.O.20. 1 .Seq_THC207240 

99.E2.sp6:131279.Seq 

RTA00000 1 90AF.O.2 1 . 1 .Seq_THC207240 

RTA00000190AF.O.21.1 

99.F2.sp6:131291.Seq 

99.A3.sp6:131232.Seq 

RTA00000191AF.a.9.1 

RTA00000191AF.3.15.2 

99.B3.sp6:131244.Seq 

RTA000001 91 AF.a.l 5.2.Seq_THC79498 

RTA00000191AF.e.3.1 

99.E3.sp6:131280.Seq 

99.H3.sp6:131316.Seq 

RTA00000191AF.f.ll.l 

RTA00000191AF.f.l3.1 

99.A4.sp6:131233.Seq 

RTA00000191AF.j.9.1 

99.D4.sp6:131269.Seq 

99.F4.sp6:131293.Seq 

RTA00000191AF.1.6.1 

99.H4.sp6:131317.Seq 

RTA00000191AF.m.l8.1 

99.D5.sp6:131270.Seq 

RTA00000191AF.p.9.1 

RTA00000192AFJ.2.1 

99.F6.sp6:131295.Seq 

RTA00000192AF.1.13.2 

123.A8.sp6:132272.Seq 
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Clone Name 

M00004185C:C03 

M00004191D:B11 

M00004191D:B11 

M00004191D:B11 

M00004197D:H01 

M00004197D:H01 

M00004197D:H01 

M00004203B:C12 

M00004203B:C12 

M00004214C:H05 

M00004214C:H05 

M00004223D:E04 

M00004223D:E04 

M00004269D:D06 

M00004269D:D06 

M00004295D:F12 

M00004295D:F12 

M00004296C:H07 

M00004296C:H07 

M00004307C:A06 

M00004307C:A06 

M00004307C:A06 

M00004312A:G03 

M00004312A:G03 

M00004312A:G03 

M00004318C:D10 

M00004318C:D10 

M00004359B:G02 

M00004359B:G02 

M00004505D:F08 

M00004505D:F08 

M00004692A:H08 

M00004692A:H08 

M00004692A:H08 

M00005180C:G03 



Cluster ID 
11443 



8210 

8210 

8210 

14311 

14311 

11451 

11451 

12971 

12971 

4905 

4905 

16921 

16921 

13046 

13046 

9457 

9457 

9457 

26295 

26295 

26295 

21847 

21847 



Sequence Name 

99.A7.sp6:! 3 1236.Seq 

RTA00000192AF.m.l2.1 

99.B7.sp6:131248.Seq 

123.C8.sp6:132296.Seq 

99.C7.sp6:131260.Seq 

123.E8.sp6:132320.Seq 

RTA00000192AF.n.l3.1 

99.D7.sp6:131272.Seq 

RTA00000192AF.O.2.1 

177.D8.sp6:134747.Seq 

RTA00000192AF.p.l7.1 

RTA00000193AF.a.20.1 

99.B8.sp6:131249.Seq 

99.H8.sp6:131321.Seq 

RTA00000193AF.e.l4.1 

99.D9.sp6:131274.Seq 

RTA00000193AF.h.l5.1 

99.E9.sp6:131286.Seq 

RTA00000193AF.h.l9.1 

RTA00000193AF.i.l4.2 

99.F9.sp6:131298.Seq 

123.Dll.sp6:132311.Seq 

RTA00000193AF.i.24.2 

99.G9.sp6:131310.Seq 

RTA00000193AF.i.24.2.Seq_THC197345 

RTA00000193AF.j.9.1 

99.H9.sp6:131322.Seq 

RTA000001 93 AF.m.5.1 .Seq_THCl 733 1 8 

RTA00000193AF.m.5.1 

RTA00000194AF.b.l9.1 

99.H10.sp6:131323.Seq 

99.Bll.sp6:131252.Seq 

RTA00000194AF.C.24.1 

377.F4.sp6:141957.Seq 

RTA00000194AF.f.4.1 
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Clone Name 
M00001346D:E03 
M00001350A:B08 
MO0O01350A:B08 
M00001357D:D11 
M00001357D:D11 
M00001409C:D12 
M00001409C:D12 
M00001418B:F03 
M00001418B:F03 
M00001418B:F03 
M00001418D:B06 
M00001421C:F01 
M00001421C:F01 
M00001429B:A11 
M00001432C:F06 
M00001439C:F08 
M00001442C:D07 
M00001442C:D07 
M00001443B:F01 
M00001443B:F01 
M00001445A:F05 
M00001445A:F05 
M00001446A:F05 
M00001455A:E09 
M00001455A:E09 
M00001460A:F12 
M00001481D:A05 
M00001490B:C04 
M00001490B:C04 
M00001500C:E04 
M00001500C:E04 
M00001532B:A06 
M00001532B:A06 
M00001534A:F09 
M00001534A:F09 
M00001535A:B01 
M00001536A:C08 
M00001536A:C08 
M00001541A:F07 
M00001542B:B01 
M00001542B:B01 
M00001544A:E03 
M00001545A:C03 
M00001545A:C03 
M00001545A:C03 
M00001548A:H09 
M00001548A:H09 
M00001548A:H09 
M00001549A:B02 
M00001549A:B02 
M00001549A:D08 
M00001552B:D04 



Cluster ID 
6806 



4059 
4059 
9577 
9577 
9952 
9952 
9952 
8526 
9577 
9577 
4635 

40054 
16731 
16731 



13532 

13532 

7801 

13238 

13238 

39498 

7985 

18699 

18699 

9443 

9443 

3990 

3990 

5321 

5321 

7665 

39392 

39392 

22085 



12170 

19255 

19255 

19255 

1058 

1058 

1058 

4015 

4015 

10944 

5708 



Sequence Name 

RTA00000177AF.g.l3.3 

80.H2.sp6:130293.Seq 

RTA00000177AF.i.6.2 

RTA00000 1 77AF.n . 1 8.3 .Seq_THC 1 2305 1 

RTA00000177AF.n.l8.3 

RTA00000179AF.O.17.1 

80.E7.sp6:130262.Seq 

RTA00000180AF.C.20.1 

RTA00000 1 80AF.C.20. 1 .Seq_THC 1 62284 

80.E8.sp6:130263.Seq 

RTA00000180AF.d.l.l 

RTA00000180AF.d.23.1 

80.G8.sp6:130287.Seq 

RTA00000180AF.i.20.1 

RTA00000180AF.k.24.1 

RTA00000180AF.p.l0.1 

RTA00000181AF.a.20.1 

80.C10.sp6:130241.Seq 

80.D10.sp6:130253.Seq 

RTA00000181AF.b.7.1 

80.E10.sp6:130265.Seq 

RTA00000181AF.C.4.1 

RTA00000181AF.C.21.1 

RTA00000181AF.m.4.1 

RTA00000 1 8 1 AF.m .4. 1 .Seq_THC 1 4069 1 

RTA00000119A.j.20.1 

RTA00000182AR.j.2.1 

RTA00000182AF.m.l6.1 

89.D3.sp6:130705.Seq 

89.B4.sp6:130682.Seq 

RTA00000183AF.C.1.1 

89.G6.sp6:130744.Seq 

RTA00000183AFJ.11.1 

89.B7.sp6:130685.Seq 

RTA00000183AF.k.8.1 

RTA00000134A.1.19.1 

89.G7.sp6:130745.Seq 

RTA00000134A.m.l6.1 

RTA00000135A.e.5.2 

RTA00000183AF.p.4.1 

89.F8.sp6:130734.Seq 

RTA00000125A.h.l8.4 

RTA00000135A.m.l8.1 

184.B10.sp6:135547.Seq 

89.C9.sp6:130699.Seq 

RTA00000126A.e.20.3.Seq_THC2 1 7534 

RTA00000126A.e.20.3 

79.F6.sp6:130081.Seq 

RTA00000136A.e.l2.1 

79.G6.sp6:130093.Seq 

RTA00000126A.h.l7.2 

RTA00000184AF.g.l2.1 
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Clone Name 

M00001552B:D04 

M00001552D:A01 

M00001552D:A01 

M0O001553D:D10 

M00001553D:D10 

M00001558A:H05 

M00001558A:H05 

M00001561A:C05 

M00001561A:C05 

M00001564A:B12 

M00001578B:E04 

M00001579D:C03 

M00001579D:C03 

M00001579D:C03 

M00001582D:F05 

M00001587A:B11 

M00001587A:B11 

M00001604A:F05 

M00001604A:F05 

M00001624A:B06 

M00001624A:B06 

M00001624A:B06 

M00001630B:H09 

M00001630B:H09 

M00001630B:H09 

M00001651A:H01 

M00001651A:H01 

M00001677C:E10 

M00001679C:F01 

M00001679C:F01 

M00001679C:F01 

M00001686A:E06 

M00003796C:D05 

M00003796C:D05 

M00003826B:A06 

M00003826B:A06 

M00003833A:E05 

M00003837D:A01 

M00003837D:A01 

M00003846B:D06 

M00003846B:D06 

M00003879B:D10 

M00003879B:DIO 

M00003879D:A02 

M00003879D:A02 

M00003891C:H09 

M00003891C:H09 

M00003912B:D01 

M00003912B:D01 

M00004072B:B05 

M00004081C:D12 

M00004111D:A08 



Cluster ID Sequence Name 

5708 89.E10.sp6:130724.Seq 

89.F10.sp6:130736.Seq 

RTA00000184AF.g.22.1 
22814 RTA00000184AF.h.l4.1 
22814 89.All.sp6:130677.Seq 

RTA00000128A.C.20.1 

89. F12.sp6.130738.Seq 
39486 RTA00000128A.m.22.2 
39486 79.B8.sp6:130035.Seq 
5053 RTA00000184AF.O.12.1 
23001 RTA00000185AF.C.24.1 
6539 90.Gl.sp6:130931.Seq 
6539 173.A12.SP6:134080.Seq 
6539 RTAOO000185AF.d.ll.l 

RTA00000185AF.d.24.1 
39380 RTA00000129A.e.24.1 
39380 79.E8.sp6:130071.Seq 
39391 RTA00000138A.C.3.1 
39391 79.A9.sp6:130024.Seq 
3277 RTA00000138A.1.5.1 
3277 217.El.sp6:139406.Seq 
3277 90.B4.sp6:130874.Seq 
5214 90.D4.sp6:130898.Seq 
5214 122.C2.sp6:132098.Seq 
5214 RTA00000186AF.g.ll.l 

RTA00000186AF.n.7.1 

90. A5.sp6:130863.Seq 
14627 RTA00000187AF.g.23.1 
78091 90.C7.sp6:130889.Seq 
78091 RTA00000187AF.j.6.1 
78091 176.G5.sp6:134588.Seq 
4622 RTA00000187AF.m.l5.2 

56 1 9 RTA000001 88AF.1.9. 1 .Seq_THC 1 67845 

5619 RTA00000188AF.1.9.1 

1 1 350 RTA000001 89AF.a.24.2 

11350 90.F9.sp6:130927.Seq 

21877 RTA00000189AF.b.21.1 

7899 90.H9.sp6:130951.Seq 

7899 RTA00000189AF.C.10.1 

6874 RTA00000189AF.e.9.1 

6874 90.C10.sp6:130892.Seq 

31587 RTA00000189AF.1.20.1 

31587 90.C12.sp6:130894.Seq 

14507 90.D12.sp6:130906.Seq 

14507 RTA00000189AR.1.23.2 

90.G12.sp6:130942.Seq 

RTA00000189AF.p.8.1 
12532 99.Dl.sp6:131266.Seq 
12532 RTA00000190AF.g.2.1 
17036 RTA00000191AF.j.l0.1 
14391 RTA00000191AF.1.7.1 
6874 RTA00000192AF.a.l4.1 
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Clone Name 


Cluster ID 


Sequence Name 


M00004111D:A08 


6874 


99.F5.sp6:131294.Seq 


M00004121B:G01 




177.H4.sp6:134791.Seq 


M00004121B:G01 




99.H5.sp6:131318.Seq 


M00004121B:G01 




RTA00000192AF.C.2.1 


M00004138B:H02 


13272 


99.A6.sp6:131235.Seq 


M00004138B:H02 


13272 


RTAOOOOO 1 92 AF.e.3.1 


M00004151D:B08 


16977 


RTAOOOOO 1 92 AF.g.3.1 


M00004169C:C12 


5319 


99.E6.sp6: 131 283. Seq 


M00004169C:C12 


5319 


RTA00000192AF.i.l2.1 


M00004169C:C12 


5319 


1 23. F7.sp6: 132331. Seq 


M00004183C:D07 


16392 


RTAOOOOO 192AF.1. 1.1 


M00004183C:D07 


16392 


RTAOOOOO 1 92AF.1. 1 . 1 .Seq_THC20207 1 


M00004230B:C07 


7212 


RTA00000193AF.b.l4.1 


M00004230B:C07 


7212 


99.D8.sp6:131273.Seq 


M00004249D:F10 




RTA00000193AF.c.21.1.SecLTHC222602 


M00004249D:F10 




RTAOOOOO 1 93 AF.c.2 1.1 


M00004275C:C1 1 


16914 


99 A9 sd6 131238 Sea 


M00004275C:C11 


16914 


RTAOOOOO 1 93 AF.f.5.1 


M00004283BA04 


14286 


RTAOOOOO 1 93 AF f 22 l 


M00004285B-E08 


56020 


RTAOOOO0 1 93 AF q 2 1 


M00004327B:H04 




RTA00000193AF.j.20.1 


M00004377C:F05 


2102 


RTA0000O193AF.n.7.1 


M00004384C:D02 




RTAOOOOO 1 93 AF.n. 15.1 


M00004384C:D02 




RTAOOOOO 1 93 AF.n. 1 5 . 1 .Seq_THC2 1 5687 


M00004461A:B08 




RTAOOOOO 194AR.a. 10.2 


M00004461A:B09 




RTA00000194AF.a.ll.l 


M00004691D:A05 




RTA00000194AF.C.23.1 


M00004896A:C07 




RTAOOOOO 1 94 AF.d. 13.1 



The above material has been deposited with the American Type Culture Collection, 
Rockville, Maryland, under the accession number indicated, This deposit will be maintained 
under the terms of the Budapest Treaty on the International Recognition of the Deposit of 
Microorganisms for purposes of Patent Procedure. The deposit will be maintained for a 
period of 30 years following issuance of this patent, or for the enforceable life of the patent, 
whichever is greater. Upon issuance of the patent, the deposit will be available to the public 
from the ATCC without restriction. 

This deposit is provided merely as convenience to those of skill in the art, and is not 
an admission that a deposit is required under 35 U.S.C. §112. The sequence of the 
polynucleotides contained within the deposited material, as well as the amino acid sequence 
of the polypeptides encoded thereby, are incorporated herein by reference and are controlling 
in the event of any conflict with the written description of sequences herein. A license may 
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be required to make, use, or sell the deposited material, and no such license is granted 

hereby. 



Retrieval of Individual Clones from Deposit of Pooled Clones 
5 Where the ATCC deposit is composed of a pool of cDNA clones, the deposit was 

prepared by first transfecting each of the clones into separate bacterial cells. The clones 
were then deposited as a pool of equal mixtures in the composite deposit. Particular clones 
can be obtained from the composite deposit using methods well known in the art. For 
example, a bacterial cell containing a particular clone can be identified by isolating single 

10 colonies, and identifying colonies containing the specific clone through standard colony 
hybridization techniques, using an oligonucleotide probe or probes designed to specifically 
hybridize to a sequence of the clone insert (e.g., a probe based upon unmasked sequence of 
the encoded polynucleotide having the indicated SEQ ID NO). The probe should be 
designed to have a T m of approximately 80°C (assuming 2°C for each A or T and 4°C for 

1 5 each G or C). Positive colonies can then be picked, grown in culture, and the recombinant 
clone isolated. Alternatively, probes designed in this manner can be used to PCR to isolate a 
nucleic acid molecule from the pooled clones according to methods well known in the art, 
e *8-> by purifying the cDNA from the deposited culture pool, and using the probes in PCR 
reactions to produce an amplified product having the corresponding desired polynucleotide 

20 sequence. 
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Table 1. Sequence identification numbers, cluster ID, sequence name, and clone name 



SEQ ID NO: Cluster ID 


Sequence Name 


Clone Name 


1 


403D 


RTAOOOOO 1 80AF.1.20. 1 


M0000l429B:Al l 


2 




RTAOOOOO 1 8 5 AF.n. 1 2 . 1 


M00001608D:A1 1 


3 


4622 


RTAOOOOO 1 87AF.m.l 5.2 


M0000l686A:E06 


4 


3706 


RTAOOOOO 1 9 1 AF.i. 1 7.2 


M00004068B:A0l 


5 


36535 


RTA00000l8lAF.f.5.l 


M00001449A:G10 


6 


3990 


RTAOOOOO 1 83 AFj. II. I 


M00001532B:A06 


1 


5319 


RTAOOOOO 1 92 AF.i. 1 2. 1 


M00004l69C:Cl2 


o 

8 


36393 


RTAOOOOO 180AF.C.2.1 


M0000l4l7A:E02 


9 


2623 


RTAOOOOO 1 83 AF.a.6.l 


M0000l497A:G02 


10 


7587 


RTAOOOOO 1 78AF.n.24. 1 


M0000l387B:G03 


11 


7065 


RTA00000l37A.g.6.l 


M0000l557A:D02 


12 


10539 


RTAOOOOO 187AF.I.7.1 


M0000l680D:F08 


13 


27250 


RTA0000Ol8lAF.g.l0.l 


M0000l450A:D08 


14 


5556 


RTAOOOOO l79AF.n.lO. I 


M0000l407B:Dll 


15 




RTAOOOOO 1 92AF.m.l 2. 1 


M00004l9lD:Bll 


16 


8761 


RTAOOOOO 1 84AF.k.l 2. 1 


M0000l557D:D09 


17 


4622 


RTA00000l89AF.g.l.l 


M00003856B:C02 


18 


11460 


RTA00O0Ol87AF.g.l2.l 


M0000l676B:F05 


19 


16283 


RTAOOOOO 1 20A.O.20.1 


M00001467A:D08 


20 


3430 


RTA00000l9lAF.a.9.l 


M0000398lA:ElO 


21 


7065 


RTAOOOOO 1 84 AF.j. 21. 1 


M00001557A:D02 


22 




RTAOOOOO 182AF.1.20.1 


M00001488B:F12 


23 




RTA00000123A.g.l9.1 


M0O0Ol53lA:Hll 


24 


16918 


RTA00000193AF.a.l6.1 


M00004223A:GlO 


25 


i a 

16914 


RTA00000193AF.f.5.1 


M00004275C:Cll 


2o 


40108 


RTA00000187AF.O.24.1 


M0000374lD:C09 


2 / 


14286 


RTAOOOOO 1 93 AF.f.22. 1 


M00004283B:A04 


2o 


1 Hf\f\A 

17004 


RTAOOOOO 1 86AF.b.2 1 . 1 


M0000l6l7C:E02 


zy 




RTAOOOOO 1 80AF.g.22.1 


M0000l426B:Dl2 


3U 


13272 


RTA000001 92AF.e.3.l 


M00004l38B:H02 


3 J 




RTAOOOOO 1 94AF.f.4. 1 


M00005l80C:G03 


32 


32663 


RTAO000OH8A.1.8.1 


M0000l450A:All 


33 




RTAOOOOO l80AF.a.9.l 


M0000l4l4A:B0l 


34 


5832 


RTAOOOOO 178AF.0.23.1 


M0000l388D:G05 


35 


7801 


RTAO000Ol8lAF.c.2l.l 


M0000l446A:F05 


36 


76760 


RTAO00OOl87AF.a.l5.l 


M0000l657D:F08 


37 


40132 


RTA00000178AF.C.7.1 


M000Ol365C:ClO 


38 




RTA000OOl83AF.e.l.l 


M0000l505C:C05 


39 


4016 


RTAO000OH8A.C.4.1 


M0000l395A:C03 


40 


5382 


RTAOO0OOl87AF.m.23.2 


M0000l688C:F09 



I4l 



WO 99/33982 



SEQ ID NO: Cluster ID 


Sequence Name 


41 


5693 


RTA00000190AF.p.l7.2 


42 


307 


RTA00000136A.O.4.2 


43 


39833 


RTA00000178AF.i.23.1 


44 




RTA00000193AF.m.5.1 


45 


5325 


RTA00000191AF.O.6.1 


46 


5325 


RTA00000191AF.O.6.2 


47 


18957 


RTA00000190AR.m.9.1 


48 


39508 


RTA00000120A.O.2.1 


49 


22390 


RTA00000136A.j.l3.1 


50 


12170 


RTAOOO00I25A.h.l8.4 


51 


4393 


RTA00000187AF.n.l7.1 


52 


19 


RTA00000182AF.b.7.1 


53 




RTA00000193AF.C.21.1 


54 


7899 


RTA00000189AF.C.10.1 


55 


40073 


RTA00000191AF.e.3.1 


56 


7005 


RTA00000179AF.O.22.1 


57 




RTA00000187AF.h.22.1 


58 


18957 


RTA00000190AF.m.9.2 


59 


18957 


RTA00000183AF.H.23.1 


60 


16283 


RTA00000182AF.C.22.1 


61 


6974 


RTA00000183AF.d.9.1 


62 


2623 


RTA00000183AF.b.l4.1 


63 


9105 


RTA00000191AF.a.21.2 


64 


13238 


RTA00000181AF.m.4.1 


65 


5749 


RTAOOOOO 185AF.3. 19.1 


66 


6455 


RTA00000193AF.b.9.1 


67 


23001 


RTA00000185AF.C.24.1 


68 


6455 


RTA00000192AF.g.23.1 


69 


13595 


RTA00000189AF.f.8.1 


70 


39442 


RTA00000120A.O.21.1 


71 


17036 


RTAOOOOO 191 AF.f. 13.1 


72 




RTA00000183AF.g.9.1 


73 


7005 


RTAOOOOO 1 81 AF.k.24.1 


74 


6268 


RTAOOOOO 1 26 A.o.23.1 


75 


16130 


RTA00000119A.C.13.1 


76 


23201 


RTAOOOOO 187AF.a. 14.1 


77 


5321 


RTAOOOOO 1 83 AF.k.8.1 


78 


13157 


RTAOOOOO 186AF.a.6.1 


79 


2102 


RTA00000193AF.n.7.1 


80 


1058 


RTAOOOOO 126A.e.20.3 


81 


40392 


RTA00000180AF.j.8.1 


82 




RTAOOOOO 1 83 AF.e.23.1 


83 


11476 


RTAOOOOO 187AF.p. 19.1 
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PCT/US98/27610 
Clone Name 

M00003978B:G05 

M00001552A:B12 

M00001378B:B02 

M00004359B:G02 

M00004093D:B12 

M00004093D:B12 

M00003958A:H02 

M00001467A:D04 

M00001551A:G06 

M00001544A:E03 

M00001693C:G01 

M00001463C:B11 

M00004249D:F10 

M00003837D:A01 

M00004028D:C05 

M00001410A:D07 

M00001679A:F06 

M00003958A:H02 

M00001528A:F09 

M00001467A:D08 

M00001504C:H06 

M00001500A:E11 

M00003983A:A05 

M00001455A:E09 

M00001571C:H06 

M00004229B:F08 

M00001578B:E04 

M00004157C:A09 

M00003851B:D10 

M00001467A:E10 

M00004035D:B06 

M00001513B:G03 

M00001454B:C12 

M00001551A:B10 

M00001453A:E11 

M00001657D:C03 

M00001534A:F09 

M00001614C:F10 

M00004377C:F05 

M00001548A:H09 

M00001429D:D07 

M00001506D:A09 

M00003747D:C05 



WO 99/33982 



SEQ ID NO: Cluster ID 


Sequence Name 


84 


3584 


RTA00000177AF.h.20.1 


85 


10470 


RTA00000180AF.f.l8.1 


86 


39425 


RTA00000133A.f.l.l 


87 


5175 


RTA00000184AF.f.3.1 


88 


13576 


RTA00000189AF.O.13.1 


89 


7665 


RTA00000134A.1.19.1 


90 


16927 


RTA00000177AF.h.9.3 


91 


6660 


RTA00000187AF.H.15.1 


92 


2433 


RTA00000191AF.a.l5.2 


93 


5097 


RTA00000134A.k.l.l 


94 


21847 


RTA00000193AFJ.9.1 


95 


3277 


RTA00000138A.1.5.1 


96 


5708 


RTA00000184AF.g.l2.1 


97 


945 


RTA00000178AR.a.20.1 


98 


16269 


RTA00000178AF.p.l.l 


99 




RTA00000183AF.C.24.1 


100 


16731 


RTA00000181AF.a.20.1 


101 


12439 


RTA00000190AF.O.24.1 


102 


3162 


RTA00000177AFJ.12.3 


103 




RTA00000194AF.b.l9.1 


104 




RTA00000193AF.n.l5.1 


105 




RTA00000186AF.n.7.1 


106 


10717 


RTA00000181AF.d.l0.1 


107 


4573 


RTA00000189AFJ.12.1 


108 




RTA00000186AF.h.l4.1 


109 


11443 


RTA00000192AF.1.13.2 


110 


5892 


RTA00000184AF.d.ll.l 


111 


3162 


RTA00000177AF.j.l2.1 


112 


10470 


RTA00000185AF.k.6.1 


113 


17055 


RTA00000187AF.m.3.1 


114 


2030 


RTA00000193AF.m.20.1 


115 


6558 


RTA00000184AF.m.21.1 


116 


23255 


RTA00000190AFJ.4.1 


117 


9577 


RTA00000179AF.O.17.1 


118 




RTA00000180AF.a.ll.l 


119 


8 


RTA00000181AF.e.l7.1 


120 


67907 


RTA00000188AF.g.ll.l 


121 


12081 


RTA00000133A.d.l4.2 


122 


2448 


RTA00000119A.j.21.1 


123 


3389 


RTA00000189AF.g.3.1 


124 


39174 


RTA00000124A.n.l3.1 


125 


24488 


RTA00000190AF.n.l6.1 


126 


8210 


RTA00000192AF.n.l3.1 



PCT/US98/27610 
Clone Name 
M00001349B:B08 
M00001424B:G09 
M00001470A:C04 
M00001550A:G01 
M00003885C:A02 
M00001535A.B01 
M00001348B:B04 
M00001679A:A06 
M00003982C:C02 
M00001534A:D09 
M00004318C:D10 
M00001624A:B06 
M00001552B:D04 
M00001362C:H11 
M00001389A:C08 
M00001504A:E01 
M00001442C:D07 
M00003975A:G11 
M00001351B:A08 
M00004505D:F08 
M00004384C:D02 
M00001651A:H01 
M00001447A:G03 
M00003871C:E02 
M00001632D:H07 
M00004185C:C03 
M00001548A:E10 
M00001351B:A08 
M00001597D:C05 
M00001682C:B12 
M00004372A:A03 
M00001560D:F10 
M00003922A:E06 
M00001409C:D12 
M00001414C:A07 
M00001448D:C09 
M00003774C:A03 
M00001469A:C10 
M00001460A:F06 
M00003857A:G10 
M00001541A:H03 
M00003968B:F06 
M00004197D:H01 
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SEQ ID NO; Cluster ID Sequence Name 



127 




RTA00000135A.1.2.2 


128 


40455 


RTA00000190AF.m.l0.2 


129 


9577 


RTA00000180AF.d.23.1 


130 


13183 


RTA00000192AF.a.24.1 


131 


5214 


RTA00000186AF.g.ll.l 


132 


67252 


RTA00000187AF.O.6.1 


133 


3108 


RTA00000188AF.d.24.1 


134 


2464 


RTA00000178AF.n.l8.1 


135 


36313 


RTA00000181AF.e.23.1 


136 


23255 


RTA00000177AF.e.l4.3 


137 


7985 


RTA00000182AR.j.2.1 


138 


8286 


RTA00000183AF.O.1.1 


139 


22195 


RTA00000180AF.g.7.1 


140 


4573 


RTA00000184AF.h.9.1 


141 


26875 


RTA00000187AF.i.l.l 


142 


7187 


RTA00000177AF.L8.2 


143 


86859 


RTA00000118A.p.8.1 


144 


4623 


RTA00000185AF.f.4.1 


145 




RTA00000121A.C.10.1 


146 


10185 


RTA00000183AF.d.5.1 


147 




RTA00000183AF.p.4.1 


148 


15069 


RTA00000191AF.1.6.1 


149 


39304 


RTA00000118A.j.21.1 


150 


8672 


RTA00000190AF.f.ll.l 


151 


13576 


RTA00000177AF.g.l6.1 


152 


6293 


RTA00000185AF.e.ll.l 


153 


16977 


RTA00000192AF.g.3.1 


154 


5345 


RTA00000189AF.I.19.1 


155 


4905 


RTA00000193AF.e.l4.1 


156 


17036 


RTA00000191AF.j.l0.1 


157 


5417 


RTA00000191AF.h.l9.1 


158 


7172 


RTA00000178AF.f.9.1 


159 


40044 


RTA00000186AF.d.l.l 


160 


4386 


RTA00000184AF.j.4.1 


161 


40044 


RTA00000183AF.g.22.1 


162 


9685 


RTA00000183AF.C.11.1 


163 


22155 


RTA00000185AF.n.9.1 


164 


10515 


RTA00000189AF.f.l8.1 


165 


6539 


RTA00000185AF.d.ll.l 


166 


15066 


RTA00000180AF.e.24.1 


167 


4261 


RTA00000180AF.h.5.1 


168 


13864 


RTA00000125A.m.9.1 


169 


6539 


RTA00000189AF.d.22.1 



PCT/US98/27610 
Clone Name 
M00001545A:B02 
M00003958C:G10 
M00001421C:F01 
M00004114C:F11 
M00001630B:H09 
M00001716D:H05 
M00003763A:F06 
M00001387A:C05 
M00001448D:H01 
M00001343D:H07 
M00001481D:A05 
M00001540A:D06 
M00001425B.H08 
M00001553B:F12 
M00001679A:F10 
M00001350A:H01 
M00001452A:B12 
M00001586C:C05 
M00001469A:A01 
M00001504C:A07 
M00001542B:B01 
M00004081C:D10 
M00001450A:A02 
M00003909D-.C03 
M00001347A:B10 
M00001583D:A10 
M00004151D:B08 
M00003879B:C11 
M00004269D:D06 
M00004072B:B05 
M00004059A:D06 
M00001371C:E09 
M00001621C:C08 
M00001556B:C08 
M00001514C:D11 
M00001501D:C02 
M00001608B:E03 
M00003853A:F12 
M00001579D:C03 
M00001423B:E07 
M00001426D:C08 
M00001545A:D08 
M00003844C:B11 
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SEQ ID NO: Cluster ID 


Sequence Name 


Clone Name 


170 


11465 


RTA00000185AF.m.l9.1 


M00001607A:E11 


171 


3266 


RTA00000184AR.g.l.l 


M00001551C:G09 


172 


102 


RTA00000184AF.O.5.1 


M00001563B:F06 


173 


16970 


RTA00000181AR.U8.2 


M00001452C:B06 


174 


12971 


RTA00000193AF.a.20.1 


M00004223D:E04 


175 


5007 


RTA00000177AF.g.2.1 


M00001346A:F09 


176 


3765 


RTA00000135A.d.l.l 


M00001541A:D02 


177 


11294 


RTA00000184AFJ.6.1 


M00001556B:GO2 


178 


3681 


RTA00000131A.g.l5.2 


M00001449A:D12 


179 


9283 


RTA00000181AR.m.21.2 


M00001455D:F09 


180 


18699 


RTA00000182AF.m.l6.1 


M00001490B.C04 


181 


86110 


RTA00000181AF.f.l2.1 


M00001449C:D06 


182 


39648 


RTA00000178AR.I.8.2 


M00001383A-.C03 


183 


7337 


RTA00000123A.b.l7.1 


M00001528A:C04 


184 


1334 


RTA00000178AFJ.7.1 


M00001379A:A05 


185 


17076 


RTA00000188AF.d.21.1 


M00003762C:B08 


186 


22794 


RTA00000138A.b.5.1 


M00001601A:D08 


187 


39171 


RTA00000186AF.1.7.1 


M00001644C:B07 


188 


8551 


RTA00000179AF.p.21.1 


M00001412B:B10 


189 


5857 


RTA00000118A.g.l4.1 


M00001449A:A12 


190 


9443 


RTA00000183AF.C.1.1 


M00001500C:E04 


191 


9457 


RTA00000193AF.U4.2 


M00004307C:A06 


192 


7206 


RTA00000182AF.O.15.1 


M00001494D:F06 


193 


22979 


RTA00000178AF.k.22.1 


M00001382C:A02 


194 


40455 


RTA00000190AR.m.l0.1 


M00003958C:G10 


195 


7221 


RTA00000191AF.p.9.1 


M00004105C:A04 


196 




RTA00000191AF.j.9.1 


M00004072A:C03 


197 


7239 


RTA00000126A.m.4.2 


M00001550A:A03 


198 


31587 


RTA00000189AF.1.20.1 


M00003879B:D10 


199 


16317 


RTA00000190AF.e.6.1 


M00003907D:H04 


200 


13576 


RTA00000189AR.O.13.1 


M00003885C:A02 


201 


5779 


RTA00000177AF.g.l4.3 


M00001346D:G06 


202 


6124 


RTA00000191AR.e.2.3 


M00004028D:A06 


203 


9952 


RTA00000180AF.C.20.1 


M00001418B:F03 


204 




RTA00000188AF.i.8.1 


M00003784D:D12 


205 


5779 


RTA00000177AF.g.l4.1 


M00001346D:G06 


206 


39490 


RTA00000128A.b.4.1 


M00001557A:F03 


207 


4416 


RTA00000187AF.h.l3.1 


M00001678D:F12 


208 


4009 


RTA00000179AF.e.20.1 


M00001396A:C03 


209 


5336 


RTA00000183AF.b.l3.1 


M00001500A:C05 


210 


39186 


RTA00000121A.p.l5.1 


M00001512A:A09 


211 


40122 


RTA00000190AF.n.23.1 


M00003970C:B09 


212 


12532 


RTA00000190AF.g.2.1 


M00003912B.D01 
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SEQ ID NO: Cluster ID 


Sequence Name 


213 


8078 


RTA00000177AR.1.13.1 


214 


3900 


RTA00000190AF.g.l3.1 


215 


7589 


RTA00000120A.p.23.1 


216 


8298 


RTA00000127A.d.l9.1 


217 


4443 


RTA00000177AF.b.20.4 


218 


26295 


RTA00000193AF.L24.2 


219 


3389 


RTA00000183AF.m.l9.1 


220 


7015 


RTA00000187AF.f.l8.1 


221 


8526 


RTA00000180AF.d.l.l 


222 


4665 


RTA00000186AF.m.3.1 


223 


1399 


RTA00000129A.O.10.1 


224 


9244 


RTA00000127A.1.3.1 


225 




RTA00000179AFJ.13.1 


226 


82498 


RTA00000118A.m.l0.1 


227 


35702 


RTA00000187AR.C.15.2 


228 


38759 


RTA00000120A.m.l2.3 


229 


39648 


RTA00000178AF.1.8.1 


230 


19105 


RTA00000133A.e.l5.1 


231 


85064 


RTA00000131A.m.23.1 


232 


9285 


RTA00000191AF.m. 18.1 


233 


9285 


RTA00000190AF.d.7.1 


234 


39391 


RTA00000138A.C.3.1 


235 




RTA00000178AF.d.20.1 


236 


39498 


RTA00000119A.j.20.1 


237 


7798 


RTA00000189AF.k.l2.1 


238 


7798 


RTA00000189AF.C.18.1 


239 


19829 


RTA00000125A.h.24.4 


240 




RTA00000188AF.d.ll.l 


241 


4275 


RTA00000120AJ.14.1 


242 


22113 


RTA00000125A.C.7.1 


243 


40314 


RTA00000186AF.C.15.1 


244 


10944 


RTA00000126A.h.l7.2 


245 


39809 


RTA00000190AF.e.3.1 


246 


22085 


RTA00000135A.e.5.2 


247 


19255 


RTA00000135A.m.l8.1 


248 


14311 


RTA00000192AF.O.2.1 


249 


8479 


RTA00000189AF.j.22.1 


250 




RTA00000189AFJ.23.1 


251 


4193 


RTA00000184AF.e.l3.1 


252 


22814 


RTA00000184AF.h.l4.1 


253 


39563 


RTA00000179AF.k.20.1 


254 


39420 


RTA00000134A.O.23.1 


255 


11589 


RTA00000177AF.b.l7.4 



146 



PCT/US98/27610 
Clone Name 

M00001353A:G12 
M00003914C:F05 
M00001468A:F05 
M00001553A:H06 
M00001341A:E12 
M00004312A:G03 
M00001537B:G07 
M00001673C:H02 
M00001418D:B06 
M00001648C:A01 
M00001604A:B10 
M00001556A:C09 
M00001400B:H06 
M00001450A:B12 
M00001663A:E04 
M00001467A:B07 
M00001383A:C03 
M00001469A:H12 
M00001452A:F05 
M00004086D:G06 
M00003906C:E10 
M00001604A:F05 
M00001368D:E03 
M00001460A:F12 
M00003876D:E12 
M00003839A:D08 
M00001544A:G02 
M00003761D:A09 
M00001466A:E07 
M00001542A:A09 
M00001619C:F12 
M00001549A:D08 
M00003907D:A09 
M00001541A:F07 
M00001545A:C03 
M00004203B:C12 
M00003875C:G07 
M00003875D:D1 1 
M00001549B:F06 
M0O0O1553D:D10 
M00001402A:E08 
M00001537A:F12 
M00001340D:F10 



WO 99/33982 PCT/US98/2761 0 



SEQIDNO: Cluster ID 


Sequence Name 


Clone Name 


256 


4937 


RTA00000191AF.p.21.1 


M00004108A:E06 


257 


39412 


RTA00000133A.k.l7.1 


M00001511A:H06 


258 


4837 


RTA00000185AR.k.3.2 


M00001597C:H02 


259 


13046 


RTA00000193AF.H.19.1 


M00004296C:H07 


260 


4141 


RTA00000177AF.p.20.3 


M00001361A:A05 


261 


38085 


RTA00000123A.e.l5.1 


M00001531A:D01 


262 




RTA00000189AF.p.8.1 


M00003891C:H09 


263 


11451 


RTA00000192AF.p.l7.1 


M00004214C:H05 


264 


14507 


RTA00000189AR.1.23.2 


M00003879D:A02 


265 


40054 


RTA00000180AF.p.l0.1 


M00001439C:F08 


266 


39423 


RTA00000134A.k.22.1 


M00001535A:F10 


267 


39453 


RTA0O00O135A.g.ll.l 


M00001542A:E06 


268 


10751 


RTA00000187AF.k.7.1 


M00001679D:D03 


269 


10751 


RTA00000187AF.k.6.1 


M00001679D:D03 


270 


78091 


RTA00000187AF.j.6.1 


M00001679C:F01 


271 


39539 


RTA00000127A.i.21.1 


M00001555A:B02 


272 




RTA0OO0O182AF.1.15.1 


M00001487B:H06 


273 




RTA00000194AF.d.l3.1 


M00004896A:C07 


274 




RTA00000128A.C.20.1 


M00001558A:H05 


275 


9283 


RTA00000181AR.m.22.2 


M00001455D:F09 


276 


39168 


RTA00000121A.1.10.1 


M00001507A:H05 


277 


39458 


RTA00000126A.p.l5.2 


M00001552A:D11 


278 


14391 


RTA00000177AF.m.l7.3 


M00001355B:G10 


279 


39195 


RTA00000137A.C.16.1 


M00001555A:C01 


280 


7212 


RTA00000193AF.b.l4.1 


M00004230B:C07 


281 


4015 


RTA00000136A.e.l2.1 


M00001549A:B02 


282 


12977 


RTA00000189AFJ.19.1 


M00003875B:F04 


283 




RTA00000178AF.m.l3.1 


M00001384B:A11 


284 


14391 


RTA00000191AF.1.7.1 


M00004081C:D12 


285 




RTA00000194AF.C.23.1 


M00004691D:A05 


286 




RTA00000181AF.b.7.1 


M00001443B:F01 


287 


8358 


RTA00000183AF.i.5.1 


M00001528B:H04 


288 


1267 


RTA00000125A.O.5.1 


M00001546A:G11 


289 




RTA00000189AF.f.7.1 


M00003851B:D08 


290 


16347 


RTA00000184AF.e.l5.1 


M00001549C:E06 


291 


7899 


RTA00000193AF.a.l7.1 


M00004223B:D09 


292 


2379 


RTA00000178AF.a.6.1 


M00001361D:F08 


293 


39478 


RTA0O00O133A.i.5.1 


M00001471A:B01 


294 


39392 


RTA00000134A.m.l6.1 


M00001536A:C08 


295 


5053 


RTA00000184AF.O.12.1 


M00001564A:B12 


296 


16999 


RTA00000185AF.k.9.1 


M00001598A:G03 


297 


39180 


RTA00000126A.n.8.2 


M00001551A:F05 


298 


1037 


RTA00000121A.f.8.1 


M00001470A:B10 
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SEQ ID NO: Cluster ID 


Sequence Name 


Clone Name 


299 


6867 


RTA00000178AF.e.l2.1 


M00001370A:C09 


300 


10539 


RTA00000183AF.a.24.1 


M00001499B:A11 


301 


41633 


RTA00000118A.g.l6.1 


M00001449A:B12 


302 


23218 


RTA00000187AR.C.5.2 


M00001662C:A09 


303 


39380 


RTA00000129A.e.24.1 


M00001587A:B11 


304 




RTA00000185AF.d.24.1 


M00001582D:F05 


305 




RTA00000177AF.O.4.3 


M00001358C:C06 


306 


6974 


RTA00000184AF.a.l5.1 


M00001544B:B07 


307 




RTA00000185AF.g.ll.l 


M00001590B:F03 


308 


15855 


RTA00000184AF.j.l.l 


M00001556A:H01 


309 


84328 


RTA00000118A.p.l0.1 


M00001452A:B04 


310 


10145 


RTA00000120A.g.l2.1 


M00001465A:B11 


311 


39805 


RTA00000177AF.C.21.3 


M00001342B:E06 


312 




RTA00000187AF.h.23.1 


M00001679A:F06 


313 


6298 


RTA00000187AR.U0.2 


M00001679B:F01 


314 


14367 


RTA00000187AF.e.8.1 


M00001670C:H02 


315 




RTA00000193AF.C.22.1 


M00004249D:G12 


316 


16921 


RTA00000183AF.k.6.1 


M00001534A:C04 


317 


1577 


RTA00000184AF.L23.1 


M00001556A:F11 


318 


8773 


RTA00000187AF.f.24.1 


M00001675A:C09 


319 




RTA00000194AF.a.ll.l 


M00004461A:B09 


320 


39886 


RTA00000178AF.j.24.1 


M00001380D:B09 


321 


13532 


RTA00000181AF.C.4.1 


M00001445A.F05 


322 




RTA00000193AF.d.2.1 


M00004251C:G07 


323 


5257 


RTA00000192AF.O.1 


M00004146C:C11 


324 


9061 


RTA00000191AR.e.ll.2 


M00004031A:A12 


325 


19267 


RTA00000186AF.1.12.1 


M00001645A:C12 


326 


20212 


RTA00000134A.1.22.1 


M00001535A:C06 


327 


16653 


RTA00000181AF.k.5.3 


M00001453C:F06 


328 


16985 


RTA00000177AF.h.l0.1 


M00001348B:G06 


329 


12977 


RTA00000189AR.j.l9.1 


M00003875B:F04 


330 


9061 


RTA00000191AR.e.ll.3 


M00004031A:A12 


331 




RTA00000194AR.a.l0.2 


M00004461A:B08 


332 


6468 


RTA00000187AF.d.l5.1 


M00001669B:F02 


333 


16392 


RTA00000192AF.1.1.1 


M00004183C:D07 


334 


14627 


RTA00000187AF.g.23.1 


M00001677C:E10 


335 


6583 


RTA00000179AF.d.l3.1 


M00001394A:F01 


336 


6806 


RTA00000177AF.g.l3.3 


M00001346D:E03 


337 


9635 


RTA00000137A.e.23.4 


M00O01557A:F01 


338 


689 


RTA00000181AR.1.22.1 


M00001454D:G03 


339 


4119 


RTA00000183AF.k.l6.1 


M00001534C:A01 


340 


8952 


RTA00000183AF.h.l5.1 


M00001518C:BU 


341 


2379 


RTA00000192AF.p.8.1 


M00004212B:C07 
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SEQ ID NO: Cluster ED 


Sequence Name 


342 


39486 


RTA00000128A.m.22.2 


343 


21877 


RTA00000189AF.b.21.1 


344 


6874 


RTA00000192AF.a.l4.1 


345 


6874 


RTA00000189AF.e.9.1 


346 


37285 


RTA00000191AF.f.ll.l 


347 




RTA00000193AF.j.20.1 


348 


7674 


RTA00000118A.g.9.1 


349 


2797 


RTA00000180AF.i.l9.1 


350 




RTA00000184AF.g.22.1 


351 


7802 


RTA00000185AF.n.5.1 


352 


16921 


RTA00000193AF.h.l5.1 


353 


11494 


RTA00000192AF.j.6.1 


354 


17062 


RTA0000OI77AF.b.8.4 


355 


16245 


RTA00000177AF.k.9.3 


356 


83103 


RTA00000119A.e.24.2 


357 


4309 


RTA00000186AF.e.22.1 


358 


13072 


RTA00000181AR.m.5.2 


359 


4059 


RTA00000177AF.n.l8.3 


360 


5178 


RTA00000178AF.n.l0.1 


361 


1120 


RTA00000118A.p.l5.3 


362 


6420 


RTA00000183AF.d.ll.l 


363 


13913 


RTA00000186AF.e.6.1 


364 




RTA00000192AF.C.2.1 


365 


3956 


RTA00000183AF.g.3.1 


366 


14364 


RTA00000183AF.g.l2.1 


367 


6880 


RTA00000191AF.m.20.1 


368 


84182 


RTA00000180AF.h.l9.1 


369 


2790 


RTA00000177AF.e.2.1 


370 


4561 


RTA00000184AF.i.21.1 


371 


8847 


RTA00000180AF.b.l6.1 


372 


56020 


RTA00000193AF.g.2.1 


373 


1531 


RTA00000119A.O.3.1 


374 


6420 


RTA00000177AF.f.l0.3 


375 




RTA00000188AF.b.l2.1 


376 




RTA00000180AF.k.24.1 


377 




RTA00000184AF.a.8.1 


378 


2696 


RTA00000134A.m.l3.1 


379 


260 


RTA00000185AR.L12.2 


380 


11350 


RTA00000189AF.a.24.2 


381 


2428 


RTA00000123A.I.21.1 


382 


4313 


RTA00000122A.n.3.1 


383 




RTA00000184AF.p.3.1 


384 


697 


RTA00000188AF.d.6.1 



149 



PCT/US98/27610 
Clone Name 

M00001561A:C05 

M00003833A:E05 

M00004111D:A08 

M00003846B:D06 

M00004035C:A07 

M00004327B:H04 

M00001416A.-H01 

M00001429A:H04 

M00001552D:A01 

M00001608A:B03 

M00004295D:F12 

M00004172C:D08 

M00001340B:A06 

M00001352A:E02 

M00001454A:A09 

M00001624C:F01 

M00001455B:E12 

M00001357D:D11 

M00001386C:B12 

M00001452A:D08 

M00001504D:G06 

M00001623D:F10 

M00004121B:G01 

M00001512D:G09 

M00001513C.E08 

M00004087D:A01 

M00001428A:H10 

M00001343C:F10 

M00001555D:G10 

M00001416B:H11 

M00004285B:E08 

M00001461A:D06 

MOOOO1345A:E01 

M00003754C:E09 

M00001432C:F06 

M00001544A:E06 

M00001536A:B07 

M00001594B:H04 

M00003826B:A06 

M00001533A:C11 

M00001517A:B07 

M00001566B:D11 

M00003759B:B09 



WO 99/33982 
SEQIDNO: Cluster ED Sequence Name 



385 


5619 


RTA00000188AF.1.9.1 


386 


4568 


RTA00000122A.d.l5.3 


387 




RTA00000177AF.i.6.2 


388 


5622 


RTA00000178AF.a.ll.l 


389 


7514 


RTA00000184AF.k.21.1 


390 


5619 


RTA00000189AF.f.l7.1 


391 


7570 


RTA00000187AF.g.24.1 


392 


23358 


RTA00000190AF.O.21.1 


393 


23210 


RTA00000190AF.O.20.1 


394 


5192 


RTA00000184AF.k.2.1 


395 


13538 


RTA00000180AF.a.24.1 


396 




RTA00000189AF.h.l7.1 


397 




RTA00000192AF.O.11.1 


398 




RTA00000184AF.1.11.1 


399 


4718 


RTA00000189AF.g.5.1 


400 


14929 


RTA00000177AF.m.l.2 


401 


4908 


RTA00000192AF.j.2.1 


402 




RTA00000178AF.k.l6.1 


403 




RTA00000194AF.C.24.1 


404 


17732 


RTA00000178AR.i.2.2 


405 


17062 


80.Al.sp6:130208.Seq 


406 


11589 


80.Bl.sp6:130220.Seq 


407 


4443 


80.Cl.sp6:130232.Seq 


408 


39805 


80.Dl.sp6:130244.Seq 


409 


2790 


80.El.sp6:130256.Seq 


410 


23255 


80.Fl.sp6:130268.Seq 


411 


6420 


8O.Gl.sp6:13028O.Seq 


412 


5007 


80.Hl.sp6:130292.Seq 


413 


13576 


80.D2.sp6:130245.Seq 


414 


16927 


80.E2.sp6:130257.Seq 


415 


16985 


80.F2.sp6:130269.Seq 


416 


3584 


80.G2.sp6: 130281. Seq 


417 




80.H2.sp6:130293.Seq 


418 


7187 


80.A3.sp6:130210.Seq 


419 


16245 


80.D3.sp6:130246.Seq 


420 


8078 


80.E3.sp6:130258.Seq 


421 


14929 


80.F3.sp6:130270.Seq 


422 


14391 


80.G3.sp6:130282.Seq 


423 


4141 


80.B4.sp6:130223.Seq 


424 


2379 


80.C4.sp6:130235.Seq 


425 


5622 


80.D4.sp6:130247.Seq 


426 


945 


80.E4.sp6:130259.Seq 


427 


40132 


80.F4.sp6: 130271. Seq 
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Clone Name 

M00003796CD05 

M00001513A:B06 

M00001350A:B08 

M00001362B:D10 

M00001558B:H11 

M00003853A:D04 

M00001677D:A07 

M00003974D:H02 

M00003974D:E07 

M00001557B:H10 

M00001415A:H06 

M00003867A:D10 

M00004205D:F06 

M00001559B:F01 

M00003857A:H03 

M00001353D:D10 

M00004171D:B03 

M00001381D:E06 

M00004692A:H08 

M00001376B:G06 

M00001340B:A06 

M00001340D:F10 

M00001341A:E12 

M00001342B:E06 

M00001343C:F10 

M00001343D:H07 

M00001345A:E01 

M00001346A:F09 

M00001347A:B10 

M00001348B:B04 

M00001348B:G06 

M00001349B:B08 

M00001350A:B08 

M00001350A:H01 

M00001352A:E02 

M00001353A:G12 

M00001353D:D10 

M00001355B:G10 

M00001361A:A05 

M00001361D:F08 

M00001362B:D10 

M00001362C:H11 

M00001365C:C10 
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SEQ ID NO: Cluster ID 


Sequence Name 


Clone Name 


428 




80.G4.sp6:130283.Seq 


M00001368D:EO3 


429 


6867 


80.H4.sp6:130295.Seq 


M00001370A:C09 


430 


7172 


80.A5.sp6:130212.Seq 


M00001371C:E09 


431 


17732 


80.B5.sp6:130224.Seq 


M00001376B:G06 


432 


39833 


80.C5.sp6:130236.Seq 


M00001378B:B02 


433 


1334 


80.D5.sp6:130248.Seq 


M00001379A:A05 


434 


39886 


80.E5.sp6:130260.Seq 


M00001380D:B09 


435 




80.F5.sp6:130272.Seq 


M00001381D:E06 


436 


22979 


80.G5.sp6:130284.Seq 


M00001382C:A02 


437 


39648 


80.H5.sp6:130296.Seq 


M00001383A:C03 


438 




80.B6.sp6:130225.Seq 


M00001384B:A11 


439 


5178 


80.C6.sp6:130237.Seq 


M00001386C:B12 


440 


2464 


80.D6.sp6:130249.Seq 


M00001387A:C05 


441 


7587 


80.E6.sp6:130261.Seq 


M00001387B:G03 


442 


5832 


80.F6.sp6:130273.Seq 


M00001388D:G05 


443 


16269 


80.G6.sp6:130285.Seq 


M00001389A:C08 


444 


6583 


80.H6.sp6:130297.Seq 


M00001394A:F01 


445 


4009 


80.A7.sp6:130214.Seq 


M000O1396A:CO3 


446 




80.B7.sp6:130226.Seq 


M00001400B:H06 


447 


39563 


80.C7.sp6:130238.Seq 


M00001402A:E08 


448 


5556 


80.D7.sp6:130250.Seq 


M00001407B:D1 1 


449 


9577 


80.E7.sp6:130262.Seq 


M00001409C:D12 


450 


7005 


80.F7.sp6:130274.Seq 


M00001410A:D07 


451 


8551 


80.G7.sp6:130286.Seq 


M00001412B:B10 


452 




80.H7.sp6:130298.Seq 


M00001414A:B01 


453 




80.A8.sp6:130215.Seq 


M00001414C:A07 


454 


13538 


80.B8.sp6:130227.Seq 


M00001415A:H06 


455 


8847 


80.C8.sp6:130239.Seq 


M00001416B:H11 


456 


36393 


80.D8.sp6:130251.Seq 


M00001417A:E02 


457 


9952 


80.E8.sp6:130263.Seq 


M00001418B:F03 


458 


9577 


80.G8.sp6:130287.Seq 


M00001421C:F01 


459 


15066 


80.H8.sp6:130299.Seq 


M00001423B:E07 


460 


10470 


80.A9.sp6:130216.Seq 


M00001424B:G09 


461 


22195 


80.B9.sp6:130228.Seq 


M00001425B:H08 


462 




80.C9.sp6:130240.Seq 


M00001426B:D12 


463 


4261 


80.D9.sp6:130252.Seq 


M00001426D:C08 


464 


84182 


80.E9.sp6:130264.Seq 


M00001428A:H10 


465 


40392 


80.H9.sp6:130300.Seq 


M00001429D:D07 


466 


16731 


80.C10.sp6:130241.Seq 


M00001442C:D07 


467 




80.D10.sp6:130253.Seq 


M00001443B:F01 


468 


13532 


80.E10.sp6:130265.Seq 


M00001445A:F05 


469 


8 


80.H10.sp6:130301.Seq 


M00001448D:C09 


470 


36313 


80.All.sp6:130218.Seq 


M00001448D:H01 
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SEQDDNO: Cluster ID 


Sequence Name 


Clone Name 


471 


5857 


80.Bll.sp6:130230.Seq 


M00001449A:A12 


472 


41633 


80.Cll.sp6:130242.Seq 


M00001449A:B12 


473 


36535 


80.Dll.sp6:130254.Seq 


M00001449A:G10 


474 


86110 


80.Ell.sp6:130266.Seq 


M00001449C:D06 


475 


32663 


80.Fll.sp6:130278.Seq 


M00001450A:A11 


476 


27250 


80.Gll.sp6:130290.Seq 


M00001450A:D08 


477 


16970 


80.Hll.sp6:130302.Seq 


M00001452C:B06 


478 


16130 


80.A12.sp6:130219.Seq 


M00001453A:E1I 


479 


16653 


80.B12.sp6:130231.Seq 


M00001453C:F06 


480 


7005 


80.C12.sp6:130243.Seq 


M00001454B:C12 


481 


13072 


80.F12.sp6:130279.Seq 


M00001455B:E12 


482 


9283 


80.G12.sp6:130291.Seq 


M00001455D:F09 


483 


23255 


100.Cl.sp6:131446.Seq 


M00001343D:H07 


484 


13576 


100.El.sp6:131470.Seq 


M00001347A:B10 


485 


7187 


100.C2.sp6:131447.Seq 


M00001350A:H01 


486 


14391 


100.E3.sp6:131472.Seq 


M00001355B:G10 


487 


945 


100.E4.sp6:131473.Seq 


M00001362C.H11 


488 


7172 


100.A5.sp6:131426.Seq 


M00001371C:E09 


489 


39648 


100.A6.sp6:131427.Seq 


M00001383A:C03 


490 


84182 


100.G9.sp6:131502.Seq 


M00001428A:H10 


491 


8 


100.Bll.sp6:131444.Seq 


M00001448D:C09 


492 


36535 


100.Dll.sp6:131468.Seq 


M00001449A:G10 


493 


82498 


100.Fll.sp6:131492.Seq 


M00001450A:B12 


494 


16970 


100.C12.sp6:131457.Seq 


M00001452C:B06 


495 


16130 


100.D12.sp6:131469.Seq 


M00001453A.-E11 


496 


7005 


121.Dl.sp6:131917.Seq 


M00001454B:C12 


497 




121.G6.sp6:131958.Seq 


M00001506D:A09 


498 


18957 


121.F7.sp6:131947.Seq 


M00001528A:F09 


499 


40044 


122.El.sp6:132121.Seq 


M00001621C:C08 


500 


5214 


122.C2.sp6:132098.Seq 


M00001630B:H09 


501 


6660 


122.B5.sp6:132089.Seq 


M00001679A:A06 


502 


13183 


123.D5.sp6:132305.Seq 


M00004114C:FU 


503 


6455 


123.E7.sp6:132319.Seq 


M00004157C:A09 


504 


5319 


123 .F7.sp6: 13233 l.Seq 


M00004169C:C12 


505 


11443 


123.A8.sp6:132272.Seq 


M00004185C:C03 


506 




123.C8.sp6:132296.Seq 


M00004191D:B11 


507 


8210 


123.E8.sp6:132320.Seq 


M00004197D:H01 


508 


9457 


123.Dll.sp6: 13231 l.Seq 


M00004307C:A06 


509 


6420 


172.El.sp6:133925.Seq 


M00001345A:E01 


510 


16245 


172.D2.sp6:133914.Seq 


M00001352A:E02 


511 


8078 


172.C3.sp6:133903.Seq 


M00001353A:G12 


512 


14929 


172.D3.sp6:133915.Seq 


M00001353D:D10 


513 


14391 


172.H3.sp6:133963.Seq 


M00001355B:G10 
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SEQ ID NO: Cluster ID 


Sequence Name 


514 


6583 


172.B8.sp6:133896.Seq 


515 


4009 


172.D8.sp6:133920.Seq 


516 




172.B9.sp6:133897.Seq 


517 




176.A3.sp6:134514.Seq 


518 


19267 


176.G3.sp6:134586.Seq 


519 


78091 


176.G5.sp6:134588.Seq 


520 


17055 


176.D6.sp6:134553.Seq 


521 


6539 


176.D9.sp6:134556.Seq 


522 




177.H4.sp6:l 34791. Seq 


523 


5257 


177.F5.sp6:134768.Seq 


524 


11494 


177.E6.sp6:134757.Seq 


525 




177.G7.sp6:134782.Seq 


526 


11451 


177.D8.sp6:134747.Seq 


527 


9283 


173.D2.SP6:134106.Seq 


528 


16283 


173.F3.SP6:134131.Seq 


529 


10539 


173.B5.SP6:134085.Seq 


530 


6420 


173.F5.SP6:134133.Seq 


531 


3956 


173.H5.SP6:134157.Seq 


532 




173.G7.SP6:134147.Seq 


533 


1577 


173.C9.SP6:134101.Seq 


534 


9635 


173.D9.SP6:134113.Seq 


535 


5192 


173.E9.SP6:134125.Seq 


536 


6539 


173.A12.SP6:134080.Seq 


537 


945 


180.C2.sp6:135940.Seq 


538 


7005 


180.H5.sp6:136003.Seq 


539 


39304 


180.G9.sp6:135995.Seq 


540 


27250 


180.B10.sp6:135936.Seq 


541 


35555 


184.A5.sp6:135530.Seq 


542 


19255 


184.B10.sp6:135547.Seq 


543 


6268 


184.C12.sp6:135561.Seq 


544 


3277 


217.El.sp6:139406.Seq 


545 


39171 


217.A12.sp6:139369.Seq 


546 


11460 


219.F2.sp6:139035.Seq 


547 


10539 


219.F6.sp6:139039.Seq 


548 


11476 


219.H8.sp6:139065.Seq 


549 


4016 


79.Al.sp6:130016.Seq 


550 


7674 


79.Cl.sp6:130040.Seq 


551 


3681 


79.El.sp6:130064.Seq 


552 


39304 


79.Fl.sp6:130076.Seq 


553 


82498 


79.Gl.sp6:130088.Seq 


554 


84328 


79.A2.sp6:130017.Seq 


555 


86859 


79.B2.sp6:130029.Seq 


556 


1120 


79.C2.sp6:130041.Seq 



153 



PCT/US98/27610 
Clone Name 

M00001394A:F01 

M00001396A:C03 

M00001400B:H06 

M00001632D:H07 

M00001645A:C12 

M00001679C:F01 

M00001682C:B12 

M00003844C:B11 

M00004121B:G01 

M00004146C:C11 

M00004172C:D08 

M00004205D:F06 

M00004214C:H05 

M00001455D:F09 

M00001467A:D08 

M00001499B:A11 

M00001504D:G06 

M00001512D:G09 

M00001544A:E06 

M00001556A:F11 

M00001557A:F01 

M00001557B:H10 

M00001579D:C03 

M00001362C:H11 

M00001410A:D07 

M00001450A:A02 

M00001450A:D08 

M00001528A:C04 

M00001545A:C03 

M00001551A:B10 

M00001624A:B06 

M00001644C:B07 

M00001676B:F05 

M00001680D:F08 

M00003747D:C05 

M00001395A:C03 

M00001416A:H01 

M00001449A:D12 

M00001450A:A02 

M00001450A:B12 

M00001452A:B04 

M00001452A:B12 

M00001452A:D08 



WO 99/33982 
SEQ ID NO: Cluster ED Sequence Name 



557 


85064 


79.D2.sp6:130053.Seq 


558 


83103 


79.G2.sp6:130089.Seq 


559 


10145 


79.F3.sp6:130078.Seq 


560 


16283 


79.H3.sp6:130102.Seq 


561 


4568 


79.D4.sp6:130055.Seq 


562 


4313 


79.F4.sp6:130079.Seq 


563 


2428 


79.A5.sp6:130020.Seq 


564 


39423 


79.C5.sp6:130044.Seq 


565 


39174 


79.E5.sp6:130068.Seq 


566 


22113 


79.F5.sp6:130080.Seq 


567 


19829 


79.H5.sp6:130104.Seq 


568 


13864 


79.B6.sp6:130033.Seq 


569 


1058 


79.F6.sp6:130081.Seq 


570 


4015 


79.G6.sp6:130093.Seq 


571 


39180 


79.A7.sp6:130022.Seq 


572 


307 


79.C7.sp6:130046.Seq 


573 


39458 


79.D7.sp6:130058.Seq 


574 


39490 


79.G7.sp6:130094.Seq 


575 


39486 


79JB8.sp6:130035.Seq 


576 


39380 


79.E8.sp6:130071.Seq 


577 


1399 


79.G8.sp6:130095.Seq 


578 


39391 


79.A9.sp6:130024.Seq 


579 


6268 


79.G9.sp6:130096.Seq 


580 




377.F4.sp6: 14 1957.Seq 


581 


2448 


89.Al.sp6:130667.Seq 


582 


1531 


89.Cl.sp6:130691.Seq 


583 


19 


89.Dl.sp6:130703.Seq 


584 


38759 


89.Fl.sp6:130727.Seq 


585 


39508 


89.Gl.sp6:130739.Seq 


586 


16283 


89.Hl.sp6:130751.Seq 


587 


39442 


89.A2.sp6:130668.Seq 


588 


7589 


89.B2.sp6:130680.Seq 


589 




89.C2.sp6:130692.Seq 


590 


12081 


89.D2.sp6:130704.Seq 


591 


19105 


89.E2.sp6:130716.Seq 


592 


1037 


89.F2.sp6:130728.Seq 


593 


39425 


89.G2.sp6:130740.Seq 


594 


39478 


89.H2.sp6:130752.Seq 


595 




89.B3.sp6:l30681.Seq 


596 




89.C3.sp6:l 30693. Seq 


597 


18699 


89.D3.sp6:130705.Seq 


598 


7206 


89.E3.sp6.i30717.Seq 


599 


2623 


89.F3.sp6:130729.Seq 



154 



PCT/US98/27610 
Clone Name 
M00001452A:F05 
M00001454A:A09 
M00001465A:B11 
M00001467A:D08 
M00001513A:B06 
M00001517A:B07 
M00001533A:C11 
M00001535A:F10 
M00001541A:H03 
M00001542A:A09 
M00001544A:G02 
M00001545A:D08 
M00001548A:H09 
M00001549A:B02 
M00001551A:F05 
M00001552A:B12 
M00001552A:D11 
M00001557A:F03 
M00001561A:C05 
M00001587A:B11 
M00001604A:B10 
M00001604A:F05 
M00001551A:B10 
M00004692A:H08 
M00001460A:F06 
M00001461A:D06 
M00001463C:B11 
M00001467A:B07 
M00001467A:D04 
M00001467A:D08 
M00001467A:E10 
M00001468A:F05 
M00001469A:A01 
M00001469A:C10 
M00001469A:H12 
M00001470A:B10 
M00001470A:C04 
M00001471A:B01 
M00001487B:H06 
M00001488B:F12 
M00001490B:C04 
M00001494D:F06 
M00001497A:G02 



WO 99/33982 



SEQ ID NO: Cluster ID 


Sequence Name 


600 


10539 


89.G3.sp6:l 30741. Seq 


601 


5336 


89.H3.sp6:130753.Seq 


602 


2623 


89.A4.sp6:130670.Seq 


603 


9443 


89.B4.sp6:130682.Seq 


604 


9685 


89.C4.sp6:130694.Seq 


605 




89.D4.sp6:130706.Seq 


606 


10185 


89.E4.sp6:130718.Seq 


607 


6974 


89.F4.sp6:130730.Seq 


608 


6420 


89.G4.sp6:130742.Seq 


609 




89.H4.sp6:130754.Seq 


610 




89.A5.sp6: 130671. Seq 


611 


39168 


89.B5.sp6:130683.Seq 


612 


39412 


89.C5.sp6:130695.Seq 


613 


39186 


89.D5.sp6:130707.Seq 


614 


3956 


89.E5.sp6:130719.Seq 


615 




89.F5.sp6: 130731. Seq 


616 


14364 


89.G5.sp6:130743.Seq 


617 


40044 


89.H5.sp6:130755.Seq 


618 


8952 


89.A6.sp6:130672.Seq 


619 


35555 


89.B6.sp6:130684.Seq 


620 


18957 


89.C6.sp6:130696.Seq 


621 


8358 


89.D6.sp6:130708.Seq 


622 


38085 


89.E6.sp6:130720.Seq 


623 




89.F6.sp6:130732.Seq 


624 


3990 


89.G6.sp6:130744.Seq 


625 


16921 


89.H6.sp6:130756.Seq 


626 


5321 


89.B7.sp6:130685.Seq 


627 


4119 


89.C7.sp6:130697.Seq 


628 


20212 


89.E7.sp6: 130721. Seq 


629 


2696 


89.F7.sp6:130733.Seq 


630 


39392 


89.G7.sp6:130745.Seq 


631 


39420 


89.H7.sp6:130757.Seq 


632 


3389 


89.A8.sp6:130674.Seq 


633 


8286 


89.B8.sp6:130686.Seq 


634 


3765 


89.C8.sp6:130698.Seq 


635 


39453 


89.E8.sp6:130722.Seq 


636 




89.F8.sp6:130734.Seq 


637 




89.H8.sp6:130758.Seq 


638 


6974 


89.A9.sp6:130675.Seq 


639 




89.B9.sp6:130687.Seq 


640 


19255 


89.C9.sp6:130699.Seq 


641 


1267 


89.D9.sp6: 1307 11. Seq 


642 


5892 


89.E9.sp6:130723.Seq 



155 



PCT/US98/27610 
Clone Name 

M00001499B:A11 

M00001500A:C05 

M00001500A:E11 

M00001500C:E04 

M00001501D:C02 

MOOOOI504A:E01 

M00001504C:A07 

M00001504C:H06 

M00001504D:G06 

M00001505C:C05 

M00001506D:A09 

M00001507A:H05 

M00001511A:H06 

M00001512A:A09 

M00001512D:G09 

M00001513B:G03 

M00001513C:E08 

M00001514C:D11 

M00001518C:B11 

M00001528A:C04 

M00001528A:F09 

M00001528B:H04 

M00001531A:D01 

M00001531A:H11 

M00001532B:A06 

M00001534A:C04 

M00001534A:F09 

M00001534C:A01 

M00001535A:C06 

M00001536A:B07 

M00001536A:C08 

M00001537A:F12 

M00001537B:G07 

M00001540A:D06 

M00001541A:D02 

M00001542A:E06 

M00001542B:B01 

M00001544A:E06 

M00001544B:B07 

M00001545A:B02 

M00001545A.C03 

M00001546A:G11 

M00001548A:E10 



WO 99/33982 
SEQIDNO: Cluster ID Sequence Name 



643 


4193 


89.G9.sp6:130747.Seq 


644 


16347 


89.H9.sp6:130759.Seq 


645 


7239 


89.A10.sp6:130676.Seq 


646 


5175 


89.B10.sp6:130688.Seq 


647 


22390 


89.C10.sp6:130700.Seq 


648 


3266 


89.D10.sp6:130712.Seq 


649 


5708 


89.E10.sp6:130724.Seq 


650 




89JF10.sp6:130736.Seq 


651 


8298 


89.G10.sp6:130748.Seq 


652 


4573 


89.H10.sp6:130760.Seq 


653 


22814 


89.All.sp6:130677.Seq 


654 


39539 


89.BU.sp6:130689.Seq 


655 


39195 


89.Cll.sp6:130701.Seq 


656 


4561 


89.Dll.sp6:130713.Seq 


657 


9244 


89.Ell.sp6:130725.Seq 


658 


1577 


89.Fll.sp6:130737.Seq 


659 


4386 


89.Hll.sp6:130761.Seq 


660 


11294 


89.A12.sp6:130678.Seq 


661 


5192 


89.D12.sp6:130714.Seq 


662 


8761 


89.E12.sp6:130726.Seq 


663 




89.F12.sp6:130738.Seq 


664 


7514 


89.G12.sp6:130750.Seq 


665 




89.H12.sp6:130762.Seq 


666 


6558 


90.Al.sp6:130859.Seq 


667 


102 


90.Bl.sp6:130871.Seq 


668 




90.Dl.sp6:130895.Seq 


669 


5749 


90.El.sp6:130907.Seq 


670 


6539 


90.Gl.sp6:130931.Seq 


671 


6293 


90.A2.sp6:130860.Seq 


672 




90.C2.sp6:130884.Seq 


673 


260 


90.D2.sp6:130896.Seq 


674 


4837 


90.E2.sp6:130908.Seq 


675 


10470 


9O.F2.sp6:130920.Seq 


676 


16999 


90.G2.sp6:130932.Seq 


677 


22794 


90.H2.sp6:130944.Seq 


678 


11465 


90.A3.sp6:130861.Seq 


679 


7802 


90.B3.sp6:130873.Seq 


680 


22155 


90.C3.sp6:130885.Seq 


681 




90.D3.sp6:130897.Seq 


682 


13157 


90.E3.sp6:130909.Seq 


683 


17004 


90.F3.sp6:l 30921. Seq 


684 


40314 


90.G3.sp6:130933.Seq 


685 


40044 


90.H3.sp6:130945.Seq 



PCT/US98/27610 
Clone Name 

M00001549B:F06 

M00001549C:E06 

M00001550A:A03 

M00001550A:G01 

M00001551A:G06 

M00001551C:G09 

M00001552B-.D04 

M00001552D:A01 

MOOO01553A:HO6 

M00001553B:F12 

MO0O01553D:D10 

MO0OO1555A:B02 

M00001555A:C01 

M0OO01555D:G10 

M00001556A:C09 

M00001556A:FI1 

M00001556B:C08 

M00001556B:G02 

M0O001557B-.H10 

M00001557D:D09 

M0OOO1558A:H05 

M00001558B:H11 

M00001559B:F01 

M00001560D:F10 

M00001563B:F06 

M00001566B:D11 

MOO0O1571C:H06 

M00001579D:C03 

MO00O1583D:A10 

M00001590B:F03 

M00001594B:H04 

M00001597C:H02 

M00001597D:C05 

M00001598A:G03 

M00001601A:D08 

M00001607A-.E11 

M00001608A:B03 

M00001608B:E03 

M00001608D:A11 

M00001614C:F10 

M00001617C:E02 

M00001619C:F12 

M00001621C:C08 



WO 99/33982 PCT/US98/27610 



SEQIDNO: Cluster ID 


Sequence Name 


Clone Name 


686 


13913 


90.A4.sp6:130862.Seq 


M00001623D:F10 


687 


3277 


90.B4.sp6:130874.Seq 


M00001624A:B06 


688 


4309 


90.C4.sp6:130886.Seq 


M00001624C:F01 


689 


5214 


90.D4.sp6:130898.Seq 


M00001630B:H09 


690 




90.E4.sp6:130910.Seq 


M00001632D:H07 


691 


39171 


90.F4.sp6:130922.Seq 


M00001644C:B07 


692 


19267 


90.G4.sp6:130934.Seq 


M00001645A:C12 


693 


4665 


90.H4.sp6:130946.Seq 


M00001648C:A01 


694 




90.A5.sp6:130863.Seq 


M00001651A:H01 


695 


23201 


90.B5.sp6:130875.Seq 


M00001657D:C03 


696 


76760 


90.C5,sp6:130887.Seq 


M00001657D:F08 


697 


23218 


90.D5.sp6:130899.Seq 


M00001662C:A09 


698 


35702 


90.E5.sp6:130911.Seq 


M00001663A:E04 


699 


6468 


90.F5.sp6:13O923.Seq 


M00001669B:F02 


700 


14367 


90.G5.sp6:130935.Seq 


M00001670C:H02 


701 


7015 


90.H5.sp6:130947.Seq 


M00001673C:H02 


702 


8773 


90.A6.sp6:130864.Seq 


M00001675A:C09 


703 


11460 


90.B6.sp6:130876.Seq 


M00001676B:F05 


704 


7570 


90.D6.sp6:130900.Seq 


M00001677D:A07 


705 


4416 


90.E6.sp6:130912.Seq 


M00001678D:F12 


706 


6660 


90.F6.sp6:130924.Seq 


M00001679A:A06 


707 




90.H6.sp6:130948.Seq 


M00001679A:F06 


708 


26875 


90.A7.sp6:130865.Seq 


M00001679A:F10 


709 


6298 


90.B7.sp6:130877.Seq 


M00001679B:F01 


710 


78091 


90.C7.sp6:130889.Seq 


M00001679C:F01 


711 


10751 


90.D7.sp6:130901.Seq 


M00001679D:D03 


712 


10539 


90.F7.sp6:130925.Seq 


M00001680D:F08 


713 


17055 


90.G7.sp6:130937.Seq 


M00001682C:B12 


714 


5382 


90.A8.sp6:130866.Seq 


M00001688C:F09 


715 


4393 


90.B8.sp6:130878.Seq 


M00001693C:G01 


716 


67252 


90.C8.sp6:130890.Seq 


M00001716D:H05 


717 


40108 


90.D8.sp6:130902.Seq 


M00003741D:C09 


718 


11476 


90.E8.sp6:130914.Seq 


M00003747D:C05 


719 




90.F8.sp6:130926.Seq 


M00003754C:E09 


720 


697 


90.G8.sp6:130938.Seq 


M00003759B:B09 


721 




90.H8.sp6:130950.Seq 


M00003761D:A09 


722 


17076 


90.A9.sp6:130867.Seq 


M00003762C:B08 


723 


3108 


90.B9.sp6:130879.Seq 


M00003763A:F06 


724 


67907 


90.C9.sp6: 130891. Seq 


M00003774C:A03 


725 




90.D9.sp6:130903.Seq 


M00003784D:D12 


726 


11350 


90.F9.sp6:130927.Seq 


M00003826B:A06 


727 


7899 


90.H9.sp6: 130951. Seq 


M00003837D:A01 


728 


7798 


90.A10.sp6:130868.Seq 


M00003839A:D08 



157 



WO 99/33982 



SEQ ID NO: Cluster ID 


Sequence Name 


729 


6539 


90.B10.sp6:130880.Seq 


730 


6874 


90.C10.sp6:130892.Seq 


731 




90.D10.sp6:130904.Seq 


732 


13595 


90.E10.sp6:130916.Seq 


733 


5619 


90.F10.sp6:130928.Seq 


734 


10515 


90.G10.sp6:130940.Seq 


735 


4622 


90.H10.sp6:130952.Seq 


736 


3389 


90.All.sp6:130869.Seq 


737 


4718 


90.Bll.sp6:130881.Seq 


738 




90.Cll.sp6:130893.Seq 


739 


12977 


90.Fll.sp6:130929.Seq 


740 


8479 


90.Gll.sp6:130941.Seq 


741 




90.Hll.sp6:130953.Seq 


742 


7798 


90.A12.sp6:130870.Seq 


743 


5345 


90.B12.sp6:130882.Seq 


744 


31587 


90.C12.sp6:130894.Seq 


745 


14507 


90.D12.sp6:130906.Seq 


746 


13576 


90.F12.sp6:130930.Seq 


747 




90.G12.sp6:130942.Seq 


748 


9285 


90.H12.sp6:130954.Seq 


749 


39809 


99.Al.sp6:131230.Seq 


750 


16317 


99.Bl.sp6:131242.Seq 


751 


8672 


99.Cl.sp6:131254.Seq 


752 


12532 


99.Dl.sp6:131266.Seq 


753 


3900 


99.El.sp6:131278.Seq 


754 


23255 


99.Fl.sp6:131290.Seq 


755 


24488 


99.C2.sp6:131255.Seq 


756 


40122 


99.D2.sp6:131267.Seq 


757 


23210 


99.E2.sp6:131279.Seq 


758 


23358 


99.F2.sp6:131291.Seq 


759 


3430 


99.A3.sp6:131232.Seq 


760 


2433 


99.B3.sp6:131244.Seq 


761 


9105 


99.C3.sp6:131256.Seq 


762 


6124 


99.D3.sp6:131268.Seq 


763 


40073 


99.E3.sp6:131280.Seq 


764 


37285 


99.H3.sp6:131316.Seq 


765 


17036 


99.A4.sp6:131233.Seq 


766 


3706 


99.C4.sp6:131257.Seq 


767 




99.D4.sp6:131269.Seq 


768 


15069 


99.F4.sp6:131293.Seq 


769 


9285 


99.H4.sp6:131317.Seq 


770 


6880 


99.A5.sp6:131234.Seq 


771 


5325 


99.C5.sp6:131258.Seq 



158 



PCT/US98/27610 
Clone Name 

M00003844C:BI1 

M00003846B:D06 

M00003851B:D08 

M00003851B:D10 

M00003853A:D04 

M00003853A:F12 

M00003856B:C02 

M00003857A:G10 

M00003857A:H03 

M00003867A:D10 

M00003875B:F04 

M00003875C:G07 

M00003875D:D1 1 

M00003876D:E12 

M00003879B:C11 

M00003879B:D10 

M00003879D:A02 

M00003885C:A02 

M00003891C:H09 

M00003906C:E10 

M00003907D:A09 

M00003907D:H04 

M00003909D:C03 

M00003912B:D01 

M00003914C:F05 

M00003922A:E06 

M00003968B:F06 

M00003970C:B09 

M00003974D:E07 

M00003974D:H02 

M00003981A:E10 

M00003982C:C02 

M00003983A:A05 

M00004028D:A06 

M00004028D:C05 

M00004035C:A07 

M00004035D:B06 

M00004068B:A01 

M00004072A:C03 

M00004081C:D10 

M00004086D:G06 

M00004087D:A01 

M00004093D:B12 



WO 99/33982 PCT/US98/27610 



SEQ ID NO: Cluster ID 


Sequence Name 


Clone Name 


m 


7221 


99.D5.sp6:131270.Seq 


M00004105C:A04 


773 


4937 


99.E5.sp6:131282.Seq 


M00004108A:E06 


774 


6874 


99.F5.sp6:131294.Seq 


M00004111D:A08 


775 


13183 


99.G5.sp6:131306.Seq 


M00004114C:F11 


776 




99.H5.sp6:131318.Seq 


M00004121B.G01 


777 


13272 


99.A6.sp6:131235.Seq 


M00004138B:H02 


778 


5257 


99.B6.sp6:131247.Seq 


M00004146C:C11 


779 


6455 


99.D6.sp6: 13 1271. Seq 


M00004157C:A09 


780 


5319 


99.E6.sp6:131283.Seq 


M00004169C:C12 


781 


4908 


99.F6.sp6:131295.Seq 


M00004171D:B03 


782 


11494 


99.G6.sp6:131307.Seq 


M00004172C:D08 


783 


11443 


99.A7.sp6:131236.Seq 


M00004185C:C03 


784 




99.B7.sp6:131248.Seq 


M00004191D:B11 


785 


8210 


99.C7.sp6:131260.Seq 


M00004197D:H01 


786 


14311 


99.D7.sp6:131272.Seq 


M00004203B:C12 


787 




99.E7.sp6:131284.Seq 


M00004205D:F06 


788 


12971 


99.B8.sp6:131249.Seq 


M00004223D:E04 


789 


6455 


99.C8.sp6:131261.Seq 


M00004229B:F08 


790 


7212 


99.D8.sp6:131273.Seq 


M00004230B:C07 


791 


4905 


99.H8.sp6: 13 1321. Seq 


M00004269D:D06 


792 


16914 


99.A9.sp6:131238.Seq 


M00004275C:C11 


793 


16921 


99.D9.sp6:131274.Seq 


M00004295D:F12 


794 


13046 


99.E9.sp6:131286.Seq 


M00004296C:H07 


795 


9457 


99.F9.sp6:131298.Seq 


M00004307C:A06 


796 


26295 


99.G9.sp6:131310.Seq 


M00004312A:G03 


797 


21847 


99.H9.sp6:131322.Seq 


M00004318C:D10 


798 




99.H10.sp6:131323.Seq 


M00004505D:F08 


799 




99.Bll.sp6:131252.Seq 


M00004692A:H08 


800 




99.Dll.sp6:131276.Seq 


M00005180C:G03 


801 


39304 


RTA0O000 1 1 8A.j .2 1 . 1 .Seq_THC 151859 




802 


2428 


RTA00000123A.1.2 1 .1 .Seq_THC205063 




803 


1058 


RTA00000126A.e.20.3.Seq_THC2 1 7534 




804 


5097 


RTA000001 34A.k. 1 . 1 .Seq_THC21 5869 




805 


20212 


RTA00000 1 34A.1.22. 1 .Seq_THC 1 28232 




806 


23255 


RTA000001 77AF.e. 14.3.Seq_THC228776 




807 


2790 


RTA000001 77AF.e.2.1 .Seq_THC22946 1 




808 


6420 


RTA000001 77AF.f. 1 0.3 .Seq_THC226443 


• 


809 


4059 


RTA000001 77AF.n. 1 8.3. Seq_THCl 23051 




810 




RTAOOOOO 1 79AF.j. 13.1 ,Seq_THC 1 05720 




811 


9952 


RTA0O00O1 80AF.C.20.1 .Seq_THCl 62284 




812 


13238 


RTAOOOOO 1 8 1 AF.m .4. 1 .Seq_THC 1 4069 1 




813 


9685 


RTAOOOOO 1 83AF.C. 11.1 .Seq_THC 1 09544 




814 




RTAOOOOO 1 83 AF.c.24. 1 .Seq_THC 1 259 1 2 
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815 6420 RTA00000183AF.d.ll.l.Seq_THC226443 

816 6974 RTA00000183AF.d.9.1.Seq_THC223129 

817 40044 RTA00000183AF.g.22.1.Seq_THC232899 

8 1 8 RTA00000 1 83 AF.g.9. 1 .Seq_THC 1 98280 

819 5892 RTA00000184AF.d.ll.l.Seq_THC161896 

820 40044 RTA00000186AF.d.l.l.Seq_THC232899 

821 RTA00000186AF.h.l4.1.Seq_THCl 12525 

822 19267 RTA00000186AF.I.12.1.Seq_THC178183 

823 8773 RTA00000187AF.f.24.1.Seq_THC220002 

824 7570 RTA00000187AF.g.24.1.Seq_THCl 68636 

825 11476 RTA00000187AF.p.l9.1.Seq_THC108482 

826 RTA00000188AF.d.l l.l.Seq_THC2 12094 

827 17076 RTA00000188AF.d.21.1.Seq_THC208760 

828 697 RTAOOOOO 188AF.d.6.1.Seq_THC 178884 

829 67907 RTA00000188AF.g.l 1.1. Seq_THCl 23222 

830 5619 RTA000001 88 AF. 1.9. l.Seq_THC 167845 

831 4718 RTA00000189AF.g.5.1.Seq_THC196102 

832 39809 RTA00000190AF.e.3.1.Seq_THC150217 

833 23255 RTA00000190AF.j.4.1.Seq_THC228776 

834 40122 RTA00000190AF.n.23.1.Seq_THC109227 

835 23210 RTA00000190AF.o.20.1.Seq_THC207240 

836 23358 RTA00000190AF.o.21.1.Seq_THC207240 

837 5693 RTA00000190AF.p.l7.2.Seq_THC173318 

838 2433 RTA00000191AF.a.l5.2.Seq_THC79498 

839 5257 RTA00000192AF.f.3.1.Seq_THC213833 

840 16392 RTA00000192AF.l.l.l.Seq_THC202071 

84 1 RTAOOOOO 1 93 AF.c.2 1 . 1 .Seq_THC222602 

842 26295 RTA00000193AF.i.24.2.Seq_THC197345 

843 RTAOOOOO 193 AF.m.5. 1 .Seq_THCl 733 1 8 

844 RTA00000193 AF.n.l 5.1 .Seq_THC2 15687 
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'. !'vV»!-:i 


Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 

Reduodant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


1 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


2 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


3 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


4 


<NONE> 


<NONE> 


<NONE> 


BAR3_CHITE 


BALBIANI RING 
PROTEIN 3 
PRECURSOR>PIR2:S 
08167 Balbiani ring 3 
protein - midge 
(Chironomus 
tentans)>GP:CTBR3_l 
i^jicnidns DaiDidni ring 
3 (BR3)gene 


1 


5 


<NONE> 


<NONE> 


<NONE> 


CYAA PODA 
N 


ADENYLATE 
CYCLASE (EC 
4.6.1.1) (ATP 
PYROPHOSPHATE- 
LYASE) (ADENYLYL 
CYCLASE)>PIR2:JC4 
747 adenylate cyclase 
(EC 4.6.1.1)- 
Podospora 

anserina>GP:PANADC 
Y_l Podospora 
anserina adenyl cyclase 
gene, exons 1-4 


1 


6 


<NONE> 


<NONE> 


<NONE> 


VP03 HSVSA 


PROBABLE 
MEMBRANE 
ANTIGEN 3 
(TEGUMENT 
PROTEIN)>PIR2:C36 
806 hypothetical 
protein ORF3 - 
saimiriine herpesvirus 1 
(strain 

11)>GP:HSGEND_3 
Herpesvirus saimiri 
complete genome 
DNA; ORF 03; 
similarity to ORF 75 
and EBV BNRF1 


0.97 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


7 


<NONE> 


<NONE> 


<NONE> 


ATFCA2J8 


Arabidopsis thaliana 
DNA chromosome 4, 
ESSA I contig 
fragment Nn* 2* 
Hydroxyproline-rich 
glycoprotein homolog; 
Similarity to 
hydroxyproline-rich 
glycoprotein precursor- 
common tobacco 


0.93 


8 


<NONE> 


<NONE> 


<NONE> 


DHAL ASPN 
G 


ALDEHYDE 
DEHYDROGENASE 
(EC 1.2.1.3) 
(ALDDH)>GP:ASNA 
LDAA_1 Aspergillus 
niger aldehyde 
dehydrogenase (aldA) 
gene, complete cds 


0.9 


9 


<NONE> 


<NONE> 


<NONE> 


NCU50264J 


Neurospora crassa two- 
component histidine 
kinase (nik-1) gene, 5' 
region and partial cds 


0.86 


10 


<NONE> 


<NONE> 


<NONE> 


NEUG BOVI 
N 


NEUROGRANIN 

(P17)(B-50 

1MMUNOREACTIVE 

C-KINASE 

SUBSTRATE) 

(BICKS) 

(FRAGMENT)>PIR2: 
A39034 neurogranin - 
bovine (fragment) 


0.82 


11 


<NONE> 


<NONE> 


<NONE> 


HUMBYSTIN 
__l 


Homo sapiens bystin 
mKNA, complete cds 


0.81 


12 


<NONE> 


<NONE> 


<NONE> 


BTBMP1_1 


Bos taurus BMP1 gene, 
partial sequence; Bone 
morphogenetic protein 
1 


0.69 


13 


<NONE> 


<NONE> 


<NONE> 


TCCYSPROT 
_1 


Tjcongolense mRNA 
for (prepro) cysteine 
proteinase 


0.56 


14 


<NONE> 


<NONE> 


<NONE> 


P60_LISIV 


PROTEIN P60 
PRECURSOR 
(INVASION- 
ASSOCIATED 
PROTEIN)>GP:LISIA 
PRELBJ Listeria 


0.15 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 












ivanovii extracellular 
protein homologue 
(iap) gene, complete 
cds 




15 


<NONE> 


<NONE> 


<NONE> 


HEX_ADE3 1 


HEXON PROTEIN 

(\ ATP PPOTPTTSJ *)\ 

(FRAGMENT)>PIR2: 
S37217hexon protein - 
human adenovirus 31 
(fragment)>GP:HSAT3 
1H_1 H;sapiens 
adenovirus type 31 
hexon gene; Hexon 
protein; Internal 
fragment containing 
hypervariable regions 


0.15 


16 


<NONE> 


<NONE> 


<NONE> 


HSU77493J 


Human Notch2 mRNA, 

palTlal CUo, 

Transmembrane 
protein; hN 


0.13 


17 


<NONE> 


<NONE> 


<NONE> 


CYB_PARTE 


CYTOCHROME B 
(EC 

1.10.2.2)>PIR2:S07743 
cytochrome b - 
Paramecium tetraurelia 
mitochondrion 
(SGC6)>GP:MIPAGE 
N_ 19 Paramecium 
aurelia mitochondrial 
complete genome; 
Apocytochrome b (AA 
1-391) 


0.078 


18 


<NONE> 


<NONE> 


<NONE> 


HUMERB27_ 
1 


Human c-erbB-2 gene, 
exon 7; C-erb-2 protein 


0.054 


19 


<NONE> 


<NONE> 


<NONE> 


DMTRXIII_2 


D;melanogaster DNA 
for trxl and trxll genes; 
Trithorax protein trxl; 
Trithorax; 

putative>GP:DMTTHO 
RAX_2 

D;meianogaster DNA 
for (putative) trithorax 
protein; Predicted 
trithorax protein 


0.047 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BiastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


20 


<NONE> 


<NONE> 


<NONE> 


CELB0281_5 


Caenorhabditis elegans 
cosmid B0281; Similar 
to reverse transcriptases 


0.043 


21 


<NONE> 


<NONE> 


<NONE> 


MOTY VIBP 
A 


SODIUM-TYPE 
FLAGELLAR 
PROTEIN MOTY 
PRECURSOR>GP:VP 
U06949_4 Vibrio 
parahaemolyticus 
BB22 RNase T (rnt) 
gene and flagellar 
motor component 
(motY) gene, complete 
cds 


0.041 


22 


<NONE> 


<NONE> 


<NONE> 


A56263 


beta-galactosidase (EC 
3.2.1.23) isozyme 12- 
Arthrobacter sp. (strain 
B7)>GP:ASU17417_1 
Arthrobacter sp; beta- 
galactosidase gene, 
complete cds 


0.04 


23 


<NONE> 


<NONE> 


<NONE> 


GSA_PSEAE 


GLUTAMATE- 1- 
SEMIALDEHYDE 

1 1 AK/flXIONylT ITACP 
Z, I-AMIINUIVIU 1 AoC 

(EC 5.4.3.8) (GSA) 
(GLUTAMATE- 1 - 
SEMIALDEHYDE 
AMINOTRANSFERA 
SE) (GSA- 
AT)>PIR2:S57898 
glutamate 1 - 
semialdehyde 2,1- 
aminomutase - 
Pseudomonas 
aeruginosa>GP:PAHE 
ML_1 P;aeruginosa 
hemL gene; Glutamate 
1-sem 


0.038 


24 


<NONE> 


<NONE> 


<NONE> 


SI 6323 


hypothetical protein - 
Arabidopsis 
tha!iana>GP:ATHBlJ 
A;thaliana homeobox 
gene Athb-1 mRNA; 
Open reading frame 


0.035 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


25 






<NUNfc> 


mc i hat 1 

lKbl_KA 1 


TXTCT TT TV I DCPTTnTAll 

INSULIN KbLbPIOR 
SUBSTRATE- 
1>PIR2:S 16948 
hypothetical protein 
IRS-1 - 

rat>GP:RNIRSHRM_l 
R;Norvegicus IRS-1 
mRNA for insulin- 
receptor; During insulin 
stimulation, undergoes 
tyrosine 

phosphorylation and 
binds 

phosphatidyl inositol 3- 
kinase 


0.027 


26 


<NONE> 


<NONE> 


<NONE> 


CEM02G9_2 


Caenorhabditis elegans 
cosmid M02G9; 
MUzOvjl; Similar to 
keratin like protein; 
cDNA EST 
yk308gll;5 comes 
from this gene; cDNA 
EST yk208ell;5 comes 
from this gene; cDNA 
EST yk208ell;3 comes 


0.0088 


27 


<NONE> 


<NONE> 


<NONE> 


S75490_3 


competence region: 
iga=IgA protease, 
comA=transformation 
competence [Neisseria 
gonorrhoeae, MS11, 
Genomic, 3 genes, 
2664 nt] 


0.0041 


28 


<NONE> 


<NONE> 


<NONE> 


EXTN TOBA 
C 


EXTENSIN 
PRECURSOR (CELL 
WALL 

HYDROXYPROLINE- 
RICH 

GLYCOPROTElN)>PI 
R2:S06733 
hydroxyproline-rich 
glycoprotein precursor 
- common 

tobacco>GP:NTEXTJ 
Tobacco HRGPnt3 
gene for extensin; 
Extensin (A A 1-620) 


0.0025 
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%::'>>: : :-: : :'>?>: : ^> 
»x.w;::w:<:>s:« 

:.;-:s,:rt:tte 


Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


29 


<NONE> 


<NONE> 


<NONE> 


HPCEGSJ 


Hepatitis C virus 
complete genome 
sequence; Polyprotein 


0.0014 


30 


<NONE> 


<NONE> 


<NONE> 


HHVBC_4 


Human hepatitis virus 
(genotype C, HMA) 
preSl,preS2, S, C, X, 
antigens, core antigen, 
X protein and 
polymerase 


0.00093 


31 








ljc t Tr,rnD/i 
1 


Homo sapiens mRNA 
for latent transforming 
growth factor-beta 
binding protein-4; 
Latent TGF-beta 
binding protein-4 


(\ (\C\V\fi\ 
U.UUUOI 


32 










transposase - 
Synechocystis sp. (PCC 
6803)>GP:D90909J0 
8 Synechocystis sp; 
PCC6803 complete 
genome, 11/27, 
1311235- 1430418; 
Transposase; 
ORF_ID:slr2062 


U.UUUO 1 


33 


<NONE> 




<NONF> 


GRN MOIIS 
~E 


GRANULINS 
PRECURSOR 
(ACROGRANIN)>GP: 
MUSAP_1 Mouse gene 
for acrogranin 
precursor, complete cds 


0 00022 


34 


<NONE> 


<NONE> 


<NONE> 


CA21 MOUS 
E 


PROCOLLAGEN 
ALPHA 2(1) CHAIN 
PRECURSOR>PlR2:A 
43291 collagen alpha 
2(1) chain precursor - 
mouse>GP:MMCOLl 
A2J Mouse COL1A2 
mRNA for pro-alpha- 
2(1) collagen 


0.00016 


35 


<NONE> 


<NONE> 


<NONE> 


MMMHC29N 
12 


Mus musculus major 
histocompatibility 
locus class III 
region:butyrophilin-like 
protein gene, partial 
cds; Notch4, PBX2, 
RAGE, lysophatidic 
acid acyl transferase- 


8.00E-05 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 

ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 












alpha, palmitoyl- 




36 


<NONE> 


<NONE> 


<NONE> 


NFH_RAT 


NEUROFILAMENT 
TRIPLET H PROTEIN 
(200 KD 

NEUROFILAMENT 
PROTEIN) (NF-H) 
(FRAGMENT) 


2.40E-05 


37 


<NONE> 


<NONE> 


<NONE> 


HUMVWFM 
1 


Human von Willebrand 
factor mRNA, 3' end; 
Von Willebrand factor 
prepropeptide 


1.70E-05 


38 


<NONE> 


<NONE> 


<NONE> 


CGHU2E 


collagen alpha 2(XI) 
chain - human 
(fragment) 


2.00E-06 


39 


<NONE> 


<NONE> 


<NONE> 


A61183 


hypothetical protein 
(sdsB region) - 
Pseudomonas sp. 


4.90E-08 


40 


<NONE> 


<NONE> 


<NONE> 


YM8I YFAS 
T 


HYPOTHFTICAT 71 1 
KD PROTEIN IN 
DSK2-CAT8 
INTERGENIC 
REGION>PIR2:S5458 
5 hypothetical protein 
YMR278w - yeast 
(Saccharomyces 
cerevisiae)>GP:SC802 
1X_4 S;cerevisiae 
chromosome XIII 
cosmid 8021; 
Unknown; YM8021;04, 
unknown, len: 622, 
CAI:0;16, 




41 


<NONE> 


<NONE> 


<NONE> 


MTCY210_31 


Mycobacterium 
tuberculosis cosmid 
Y210; Unknown; 
MTCY210;31, 
unknown, len: 299 aa, 
slight similarity to 
carboxykinases 


3.10E-10 
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'•• ":•**' - : 


Nearest 
Neighbor 
(BlastN vs. 
Genbaak) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


:M = 
: ■••••■*><.- 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


42 


<NONE> 


<NONE> 


<NONE> 


CEC01G10_5 


Caenorhabditis elegans 
cosmid C01G10, 
complete sequence; 
C01G10;8;CDNA EST 
v^dvisixoix. comes 
from this 

gene>GP:CEC01G10_ 
5 Caenorhabditis 
elegans cosmid 
C01G10; C01G10;8; 
CDNA EST 
^.cfwoC'f Dt\ comes 
from this gene 


2.30E-12 


43 


<NONE> 


<NONE> 


<NONE> 


HSU15779J 


Human p70 (ST5) 
mRNA, alternatively 

cr\lir , f»rl rnmnlptp /*Hc* 

bpiiLcu, wumpicic cu;>, . 
Differentially 
expressed; alternatively 
spliced 


9.50E-14 


44 


<NONE> 


<NONE> 


<NONE> 


MTCY2 10,31 


Mycobacterium 
tuberculosis cosmid 
Y210; Unknown; 
MTCY21031 
unknown, len: 299 aa, 
slight similarity to 
carboxykinases 


1.70E-17 


45 


U61403 


Dictyostelium 
discoideum PrlA 
(prlA) mRNA, 
partial cds. 


1 


U93472J 


Danio rerio PPARB 
gene, partial cds; 
Nuclear receptor C 
domain 


0.95 


46 


Z92832 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
F31D4; HTGS 
phase 1 . 


1 


U93472J 


Danio rerio PPARB 
gene, partial cds; 
Nuclear receptor C 
domain 


0.94 


47 


L36557 


Oryza sativa 
(clone pRG3) 
repetitive 
element. 


1 


HSU61262_1 


Human neogenin 
mRNA, complete cds 


0.89 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


48 


AF005898 


Homo sapiens 
in a, iv- a i rase 
beta-3 subunit 
pseudogene, 
complete 
sequence. 


1 


LRP1_CHICK 


LOW-DENSITY 

LlrUrKU 1 lillN 

RECEPTOR- 
RELATED PROTEIN 
1 PRECURSOR (LRP) 
(ALPHA-2- 
MACROGLOBULIN 
RECEPTOR) 
(A2MR)>PIR2:A53102 
LDL receptor-related 
protein / alpha-2- 
macroglobulin receptor 
precursor - 

chicken>GP:GGLRPA 
2MR 1 Gjgallus 
mRNAforLRP/alp 


0.85 




1 T 1 

U 187*0 


Saccharomyces 
cerevisiae 
chromosome V 
cosmids 9669, 
8334,8199, and 
lambda clone 
1160. 


1 


NKL1 SQUA 
C 


D I IK ACT A XT I r^C 

BUMblANlDb- 
SENSITIVE SODIUM- 
(POTASS1UM)- 
CHLORIDE 
COTRANSPORTER 2 
(NA-K-CL 

SYMPORTER)>PIR2: 
A53491 bumetanide- 
sensitive Na-K-Cl 
cotransoorter - SDinv 
dogfish>GP:SANKCC 
1_1 Squalus acanthias 
bumetanide-sensitive 
Na-K-Cl cotransport 
protein (NKCC 


0.73 


50 


AC002523 


Homo sapiens; 
HTGS phase 1, 
54 unordered 
pieces. 


1 


BXEN CLOB 
0 


BOTULINUM 
NEUROTOXIN TYPE 
E, NONTOXIC 
COMPONENT>GP:C 
LOENT120J 
C;botulinum gene for 
nontoxic component of 
progenitor toxin, 
complete cds 


0.71 
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Nearest 
Neighbor 
(BlastN vs. 
Gen bank) 


Nearest 
Neighbor 
(BiastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


51 


AC002345 


*** 

SEQUENCING 
IN PROGRESS 
*** Genomic 
Sequence irom 
Human 17; 
HTGS phase 1, 
1 0 unordered 
pieces. 


1 


P3K2JDICDI 


PHOSPHATIDYLINO 
SITOL 3-KINASE 2 
(EC 2.7.1.137) (PI3- 
KINASE) (PTDINS-3- 

(PI3K)>GP:DDU23477 
_1 Dictyostelium 
discoideum 
phosphatidylinositol- 
4,5-diphosphate 3- 
kinase (PIK2) mRNA, 
complete cds 


0.58 


jZ 




Human mRNA 
for cripto protein. 


l 


i:>5oj 1 


noradrenaline 
transporter - 
bovine>GP:BTU09198 
_1 Bostaurus 
noradrenaline 
transporter mRNA, 
complete cds 


0.55 


53 


U23516 


Caenorhabditis 
elegans cosmid 

Dut 1 O. 


1 


169024 


MHC sex-limited 
protein - mouse 
(iragment pur m u om 
HC4AD_1 Mouse class 
III H2-Slp sex-limited 
protein gene, exons 1, 2 
and 3; MHC sex- 
limited protein 


0.47 


54 


AB006698 


Arabidopsis 
thaliana genomic 
DNA, 

chromosome 5, 
PI clone: 
MCL19. 


1 


S81293J 


LI {insertion sequence, 
nro virus \ fhuman 

LSI \J T 11 UJ j I * * Ulllill i 

papillomavirus type 6b 
HPV6b, KP4, Genomic 
Mutant, 121 nt]; 
Authors note this 
reading frame results 
from a 454 bp deletion 
and resulting 


0.25 


55 


K03458 


Human 

immunodeficienc 
y virus type 1 , 
isolate Zaire 6, 
vif, tat, rev, env, 
nef genes and 3' 
LTR. 


1 


S13383 


hydroxyproline-rich 
glycoprotein - sorghum 


0.24 
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Non- 
Redundant 
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SEQ 
ID 


ACCESSION 
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P 

VALUE 


ACCESSION 


DESCRIPTION 


p 

VALUE 


56 


B26794 


T1016TR TAMU 
Arabidopsis 
thaliana genomic 
clone Tl 01 6. 


1 


RK34 PORP 
U 


CHLOROPLAST 50S 

RIBOSOMAL 

PROTEIN 

L34>PIR2:S73111 

ribosomal protein L34 - 

red alga (Porphyra 

purpurea) 

chloroplast>GP:PPU38 
804_4 Porphyra 
purpurea chloroplast 
genome, complete 
sequence; 50S 
ribosomal protein L34 


0.021 


57 


Z98950 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
* * * from clone 
507115; HTGS 
phase 1. 


1 


D41132 


collagen-related protein 
4 - Hydra 
magnipapillata 
(fragment)>PIR2:S219 
32 mini-collagen - 
Hydra 

sp.>GP:HSNCOL4 1 
HydraN-COL4 
mRNA for mini- 
collagen; No start 
codon 


0.02 


58 


U57057 


Human WD 
protein IR10 
mRNA, complete 
cds. 


1 


DMU15602_1 


Drosophila 
melanogaster (zeste- 
white 4) mRNA, 
complete cds; Similar 
to C; elegans B0464;4 
gene product, Swiss- 
Prot Accession Number 
Q03562 


0.019 


59 


U57057 


Human WD 
protein IR10 
mRNA, complete 
cds. 


i 


CR2 MOUSF 


COMPI FMFNT 
RECEPTOR TYPE 2 
PRECURSOR (CR2) 
(COMPLEMENT C3D 
RECEPTOR)>PIR2:A4 
3526 complement 
C3d/Epstein-Barr virus 
receptor 2 precursor - 
mouse>GP:MUSCR2A 
A_l Murine 
complement receptor 
type 2 (CR2) mRNA, 
complete cds; 
Complement receptor 
type 


U.uU / *t 
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Nearest 








Neighbor 
(BlastN vs. 
Genbank) 






Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 






SEQ 


ACCESSION 


DESCRIPTION 


P 


ACCESSION 


DESCRIPTION 


P 


ID 






VALUE 






VALUE 


60 


B65337 


CIT-HSP- 
2021H21.TF 
CIT-HSPHomo 
sapiens genomic 
clone 2021H21. 


1 


A38096 


perlecan precursor - 
human>GP:HUMHSP 
G2B_J Human heparan 
sulfate proteoglycan 
(HSPG2) mRNA, 
complete cds 


0.0051 


61 


U84722 


Human vascular 
endothelial 
cadherin mRNA, 
complete cds. 


1 


HSTAFII13J 


H;sapiens mRNA for 
TAFII135;Subunitof 
RNA polymerase 11 
transcription factor 
TFIID 


0.0012 


62 


L41493 


Avian rotavirus 
(strain turkey 1) 
genomic segment 
4 outer capsid 
protein (VP8*) 
gene. 


1 


Y328 MYCP 
N 


HYPOTHETICAL 
PROTEIN MG328 
HOMOLOG>PIR2:S73 
693 MG328 homolog 
P01_orfl033 - 
Mycoplasma 
nneumoniae (ATCC 
29342) 

(SGC3)>GP:MPAE000 
035_2 Mycoplasma 
pneumoniae from bases 
442306 to 452472 
(section 35 of 63) of 
the complete genome; 
MG328 homolog, 


0.00015 


63 


D63139 


Aeromonas sp. 
gene for 
chitinase, 
complete and 
partial cds. 


1 


MTCY16B7_3 


Mycobacterium 
tuberculosis cosmid 
SCY16B7; Unknown; 
MTCY16B7;03, 
initiation factor, len: 
900, similar at C- 
terminal half to eg 
IF2_BACSUP17889 
initiation factor if-2 
(716aa),fasta 


6.30E-05 
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Genbank) 






Neighbor 
(BlastX vs. 
Non- 
Redundant 














Proteins) 






SEQ 


ACCESSION 


DESCRIPTION 


P 


ACCESSION 


DESCRIPTION 


P 


ID 






VALUE 






VALUE 


64 


J04974 


Human alpha-2 
type XI collagen 
mRNA 
(COL11A2). 


1 


GDF6 BOVI 
N 


GROWTH/DIFFEREN 
TIATION FACTOR 
GDF-6 PRECURSOR 
(CARTILAGE- 
DERIVED 
MORPHOGENETIC 
PROTEIN 2) (CDMP- 
2) 

(FRAGMENT)>PIR2: 
B55452 cartilage- 
derived morphogenetic 
protein 2 precursor - 
bovine 

(fragment)>GP:BTU13 
66 1_J Bos taurus 
cartilage-derived morp 


1.00E-05 


65 


AC002394 

* \ V> \J\J 4*^) S i 


Homo sanierts 
Chromosome 16 
BAC clone 
CIT987-SKA- 
21 1C6 -complete 
genomic 
sequence, 
complete 
sequence. 


] 


6 


f'flPTinrhsi nHittc ^l^rrQfic 
v^aciiwi uaUUlll j cicgaJlb 

cosmidCI4Fll; 
Similar to aspartate 
aminotransferase; 
coded for by C; elegans 
cDNA CEMSF95FB; 
coded for by C; elegans 
cDNA yk41e4;3; coded 
for by C; elegans 




66 


AB002312 


Human mRNA 
forKIAA0314 
gene, partial cds. 


1 


NAT1 YEAS 
T 


N-TERMINAL 

ACETYLTRANSFER 

ASE 1 (EC 2.3.1.88) 

(AMINO-TERMINAL, 

ALPHA- AMINO, 

ACETYLTRANSFER 

ASE1) 


1.00E-09 


67 


AC003085 


Human BAC 
clone RG094H21 
from 7q21-q22, 
complete 
sequence. 


1 


DP 19 CAEEL 


DPY-19 

PROTEIN>PIR2:S446 
29 f22b7. 10 protein - 
Caenorhabditis 
elegans>GP:CELF22B 
7_9 C;aenorhabditis 
elegans (Bristol N2) 
cosmid F22B7; 
Putative 


4.20E-11 


68 


X55026 


P.anserina 
complete 
mitochondrial 
genome. 


1 


NAT1 YEAS 
T 


N-TERMINAL 
ACETYLTRANSFER 
ASE 1 (EC 2.3. 1.88) 
(AMINO-TERMINAL, 
ALPHA- AMINO, 
ACETYLTRANSFER 


8.40E-12 
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SEQ 
ID 
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P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 












ASE 1) 




69 


Z95399 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
Y39B6; HTGS 
phase 1. 


1 


CER06B9J 


Caenorhabditis elegans 
cosmid R06B9, 
complete sequence; 
R06B9;b; Protein 
predicted using 
Genefinder; 
preliminary prediction 


1.50E-24 


70 


AC002339 


Arabidopsis 
thai i an a 
chromosome II 
BACT11A07 
genomic 
sequence, 
complete 
sequence. 


0.99 


POLG BVDV 
S 


GENOME 

POLYPROTEIN>PIRl 
:A44217 genome 
polyprotein - bovine 
viral diarrhea virus 
(strain SD- 

l)>GP:BVDPOLYPR 
0_1 Bovine viral 
diarrhea virus 
polyprotein RNA, 
complete cds; Putative 


1 


71 


Y08559 


B.subtilis urease 
upcrun oiiu 
downstream 
DNA. 


0.99 


LRPCAEEL 


LOW-DENSITY 

T TPOPPfYTPrM 
hir\Jri\.\J 1 dlN 

RECEPTOR- 
RELATED PROTEIN 
PRECURSOR 
(LRP)>PIR2:A47437 
LDL-receptor-related 
protein - 
Caenorhabditis 
elegans>GP:CEF29Dl 
1_2 Caenorhabditis 
elegans cosmid 
F29D11, complete 
sequence; F29D11;1; 
Protein predicted using 
Genefl 


1 
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P 
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P 
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72 


U67548 


Methanococcus 
jannaschii from 
bases 986219 to 
996377 (section 
90 of 150) of the 
complete 
genome. 


0.99 


YB60 YEAS 
T 


HYPOTHETICAL 16.3 
KD PROTEIN IN 
DUR1,2-NGR1 
INTERGENIC 
REGION>PIR2:S4608 
4 probable membrane 
protein YBR210w- 
yeast (Saccharomyces 
cerevisiae)>GP:SCYB 
R210W_1 S;cerevisiae 
chromosome II reading 
frame ORF YBR210w 


1 


73 


U51645 


Plasmodium 
falciparum 
cytidine 
triphosphate 
synthetase gene, 
complete cds. 


0.99 


HPSVRPLJ 


Sin Nombre virus (NM 
H10)RNAL segment 
encoding RNA 
polymerase (L protein), 
complete cds; Viral 
RNA polymerase (L 
protein); 

Putative>GP:HPSVRP 
LA 1 Sin Nombre 
virus (NM Rl 1) RNA 
L segment encoding 
RNA polymerase (L 
pruicinj, ^umpicic wua, 

Vir 


0.99 


74 


Z49889 


Caenorhabditis 
elegans cosmid 
T06H11, 
complete 
sequence. 


0.99 


MUSHDPRO 
BJ 


Mouse alternatively 
spliced HD protein 
mRNA, complete cds 


0.021 


75 


Z69374 


Human DNA 
sequence from 
cosmid LI 74G8, 
Huntington's 
Disease Region, 
chromosome 
4pl6.3 contains a 
pair of ESTs. 


0.99 


NCPR YEAS 
T 


NADPH- 

CYTOCHROME P450 
REDUCTASE (EC 
1.6.2.4) (CPR) 


0.017 
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ACCESSION 
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P 
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76 




&.cerevisiae 
chromosome II 
reading frame 
ORF YBL086c. 


fi QQ 


PVPA PAPP 

L 


PPPTinVI PDOI V! 

CIS-TRANS 
ISOMERASE10(EC 
5.2.1.8) (PPIASE) 
(ROTAMASE) 
(CYCLOPHILIN- 
10)>GP:CELB0252_4 
Caenorhabditis elegans 
cosmid B0252; Similar 
to peptidyl-prolyl cis- 
trans isomerase 
(PPIASE) 

(CYCLOPHILIN)>GP: 
Caenorhabditis e! 




77 


L35330 


Rattus norvegicus 
glutathione S- 

transferase YhT 

11 Clow I \JJ 

subunit gene, 
complete cds. 


0.99 


CELR148J 


Caenorhabditis elegans 
cosmid R148; Contains 

cimilaritv tn Hrrtcrmhiln 
oiiiuicu ivy wj \\i\ja\jyiikici 

DNA-binding protein 
K10(NID:g8148); 
coded for by C; elegans 
cDNA ykll8ell;5; 
coded for by C; elegans 
cDNA 


0.0032 


78 


Y00324 


Chicken 

vitellogenin gene 
3' flanking 
region. 


0.99 


A56922 


transcription factor shn 
- fruit fly (Drosophila 
melanogaster) 


0.0023 


79 


M32659 


D.melanogaster 
Shabl 1 protein 
mRNA, complete 
cds. 


0.99 


OMU25146J 


Oncorhynchus mykiss 
recombination 
activating protein 2 
gene, partial cds 


0.0017 


80 


Z69880 


H. sapiens 
SERCA3 gene 
(partial). 


0.99 


M84D DRO 
ME 


MALE SPECIFIC 
SPERM PROTEIN 
MST84DD>PIR2:S257 
75 testis-specific 
protein Mst84Dd - fruit 
fly (Drosophila 
melanogaster)>GP:DM 
MST84D4 
D;melanogaster 
Mst84Da, Mst84Db, 
Mst84Dc and Mst84Dd 
genes for put; sperm 
protein 


0.0011 
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81 


M991 66 


Escherichia coli 
Trp repressor 
binding protein 
(wrbA) gene, 
complete cds. 


0.99 


MTU88962J 


Mycobacterium 
tuberculosis unknown 
protein gene, partial cds 


6.50E-07 


82 


X99257 


R.norvegicus 
mRNA for lamin 
C2. 


0.99 


MIU68729J 


Meloidogyne incognita 
cuticle preprocollagen 
(col-2) mRNA, 
complete cds; Putative 


1.60E-09 


83 


AC002432 


Human BAC 
clone RG317G18 
from 7q31, 
complete 
sequence. 


0.98 


1FMDC 


Foot and mouth disease 
virus type c-s8cl, chain 
C - foot and mouth 
disease virus type c- 
s8cl expressed in 
hamster kidney cells 


0.14 


84 


Z34799 


Caenorhabditis 
elegans cosmid 
F34D10, 
complete 
sequence. 


0.98 


MMU57368J 


Mus musculus EGF 
repeat transmembrane 
protein mRNA, 
complete cds; Notch 
like repeats; notch 2 


0.0028 


85 


B 15207 


344E15.TV 
CIT978SKA1 
Homo sapiens 
genomic clone A- 
344E15. 


0.98 


POLG HCVJ 
6 


GENOME 
POLYPROTEIN 
(CONTAINS: CAPSID 
PROTEIN C (CORE 
PROTEIN); MATRIX 
PROTEIN 
(ENVELOPE 
PROTEIN M); MAJOR 
ENVELOPE 
PROTEIN E; 
NONSTRUCTURAL 
PROTEINS NS1.NS2, 
NS4A AND NS4B; 
HELICASE(NS3); 
RNA-DIRECTED 
RNA POLYMERASE 
(EC 2.7.7.48) 
(NS5))>PI 


0.00083 
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86 


AC002412 


*** 

SEOUENCING 
IN PROGRESS 
*** Human 
Chromosome X; 
HTGS Dhase 1 2 

111 \JkJ UllCUW 1 , £m 

unordered pieces. 


0.98 


KDG1_ARAT 
H 


DIACYLGLYCEROL 
KINASE 1 (EC 
2.7.1.107) 
(DIGLYCERIDE 
KINASE) (DGK1) 
(T)AG KTNASF 

^l-/./\ VJ i>.l 1 lAOL/ 

1)>PIR2:S71467 
diacylglycerol kinase 
(EC 2.7.1.107) 
ATDGK1 - 
Arabidopsis 
thaliana>GP:ATHATD 
GK1_1 Arabidopsis 
thaliana mRNA for 
diacylglycerol kinase, 
comt)lete c 


0.00024 


87 


X57010 


Human COL2A1 
gene for collagen 
11 alpha 1 chain, 
exons E2-E15. 


0.98 


D80005J 


Human mRNA for 
K1AA0183 gene, 
partial cds 


5.90E-10 


88 


M83093 


Neurosoora 
crassa cAMP- 
dependent protein 
kinase (cot-1) 
gene, complete 
cds. 


0.98 


YA53 SCHP 
O 


HYPOTHFTICAl 24 2 

1111 Ks 1 1 ll~s 1 IVou 

KD PROTEIN 
C13A11.03IN 
CHROMOSOME 
I>GP:SPAC13A1I_3 
S;pombe chromosome I 
cosmid cl3All; 
Unknown; 
SPAC13A11;03, 
unknown, len: 210 


3 00F-22 


89 


U96271 


Helicobacter 
pylori heat shock 
protein 70 
(hsp70) gene, 
complete cds. 


0.97 


SLMEN6J 


S;latifolia mRNA for 
Men-6 

protein>GP:SLMEN6_ 
1 S;latifolia mRNA for 
Men-6 protein 


0.43 


90 


U49944 


Caenorhabditis 
elegans cosmid 
C39E6. 


0.97 


RON HUMA 
N 


MACROPHAGE 
STIMULATING 
PROTEIN RECEPTOR 
PRECURSOR (EC 
2.7.1.1 12)>PIR2:I3818 
5 protein-tyrosine 
kinase (EC 2.7.1.112), 
receptor type ron - 
human>GP:HSRON_l 
H;sapiens RON mRNA 
for tyrosine kinase; 
Putative 


0.034 
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91 


Y09255 


Bxereus dnal 

npnp nnrtinl 
gCUC, yen liui. 


0.97 


CELT05C!_5 


Caenorhabditis elegans 

rncmirl Tft^fM* PrtrlpH 

for by C; elegans 
cDNA yk30f6;3; coded 
for by C; elegans 
cDNA yk34fl0;3 


0.00043 


92 


AC002413 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome X; 
HTGS phase 1,2 
unordered pieces. 


0.96 


CELC44E4_5 


Caenorhabditis elegans 
cosmid C44E4; Weak 
similarity to the 
drosophila hyperplastic 
disc protein 
(GB:L 14644); coded 
for by C; elegans 
cDNA yk49h6;5; coded 
for by C; elegans 
cDNA 


1 


93 


U41625 


Caenorhabditis 
elegans cosmid 
K03A1. 


0.96 


HMGCHUM 

A XT 

AN 


HIGH MOBILITY 

/~<t) ni id dd /~\T > irrM 
OKUUr rKU 1 blN 

HMGI- 

C>PIR2:JC2232 high 
mobility group I-C 
phosphoprotein - 
human>GP:HSHMGIC 
G5_l Human high- 
mobility group 
phosphoprotein isoform 
I-C (HMGIC) gene, 
exon 

5>GP:HSHMGICPJ 
H;sapiens mRNA for 
HMGI-C 

protein>GP:HSHMGIC 


1 


94 


Z82202 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
34P24; HTGS 
phase 1 . 


0.96 


YTH3 CAEE 
L 


HYPOTHETICAL 75.5 
KD PROTEIN 
C14A4.3 IN 
CHROMOSOME 
II>GP:CEC14A4_3 
Caenorhabditis elegans 
cosmid C14A4, 
complete sequence; 
C14A4;3; Weak 
similarity with a B; 
Flavum translocation 
protein (Swiss Prot 
accession number 
P38376) 


0.73 
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95 


AL008734 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
324M8; HTGS 
phase 1 . 


0.96 


S25299 


extensin precursor 
fclone Tom I -4^ - 
tomato>GP:TOMEXT 
ENB_1 Ljesculentum 
extensin (class II) gene, 
complete cds 


0.0004 


96 


L15388 


Human G 
protein-coupled 
receptor kinase 
(GRK5) mRNA, 
complete cds. 


0.96 


HUMCOL7A1 
X_l 


Homo sapiens (clones: 
CW52-2, CW27-6, 
CW15-2, CW26-5, 11- 
67) collagen type VII 
intergenic region and 

(CCW 7 A 1 ^ (TAnp 

/ /\ i ) gene, 
complete cds 


4.60E-06 


97 


X97384 


A.thaliana atran3 
gene. 


0.95 


<NONE> 


<NONE> 


<NONE> 


98 


M62505 


Human C5a 
anaphylatoxin 
receptor mRNA, 
complete cds. 


0.95 


RIPB_BRYDI 


RIBOSOME- 
INACTIVATING 
PROTEIN BRYODIN 
(RRNA N- 

GLYCOSIDASE)(EC 
3.2.2.22) 

(FRAGMENT)>PIR2: 
SI 6491 rRNAN- 
glycosidase (EC 
3.2.2.22) bryodin-red 
bryony (fragment) 


0.83 


99 


D28778 


Cucumber mosaic 
virus RNA 1 fhr 
la, complete 
sequence. 


0.95 


POLS_RUBV 
M 


STRUCTURAL 
POT YPROTFrM 

(CONTAINS: 
NUCLEOCAPSID 
PROTEIN C; 
MEMBRANE 
GLYCOPROTEINS El 
AND 

E2)>PIR1:GNWVR3 
structural polyprotein - 
rubella virus (strain 
M33)>GP:TORUB24S 
_1 Rubella virus 24 S 
subgenomic mRNA for 
structural proteins El, 
E2 and C; 


0.00037 


100 


AFO 16202 


Homo sapiens 
immunoglobulin 
heavy chain 
CDR3 gene, 


0.93 


HSU79716J 


Human reelin (RELN) 
mRNA, complete cds 


1 
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partial cds. 










101 


Z68303 


Caenorhabditis 

ZK809, complete 
sequence. 


0.93 


HS5HT4SAR_ 
1 


H;sapiens mRNA for 

CArrttnnin AQA rf*rf>r\tf^r 
bCIULUIUll tOrt ICl-CLHUI 

(5-HT4SA-R) 


0.87 


102 


X03049 


E. coli DNA 
sequene 5' to 
origin of 
replication oriC. 


0.93 


S37594 


mucin - human 
(fragment) 


0.0019 


103 


M32659 


D.melanogaster 
Shabll protein 
mRNA, complete 
cds. 


0.93 


S38480 


nonstructural protein - 
rubella 

virus>GP:RVM33NP_ 
1 Rubella virus M33 
RNAfora 

nonstructural protein; 
Nonstructural protein 
genes 


2.30E-06 


104 


D88687 


Human mRNA 

IOI IxlVI- 1 UZr" 

derived 
reductase-like 
factor, complete 
cds. 


0.93 


BAT3_HUMA 

XT 
IN 


LARGE PROLINE- 

RTPH PR OTP I'M RATI 
1x1 V^fl r t\\J 1 L/l IN D/A 1 J 

(HLA-B- 
ASSOCIATED 
TRANSCRIPT 
3)>PIR2:A35098 MHC 
class 111 

histocompatibility 
antigen HLA-B- 
associated transcript 3 - 
human>GP:HUMBAT 
3A J Human HLA-B- 
associated transcript 3 
(BAT3) mRNA, 
complete 

cds>GP:HUMBAT3 


8.70E-07 


105 


D16847 


Mouse mRNA for 
stromal cell 
derived protein- 1, 
complete cds. 


0.93 


S52796 


prpL2 protein - human 
(fragment)>GP:HSPRP 
L2_l H;sapiens mRNA 
for PRPL-2 protein 


3.20E-08 
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P 
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P 
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106 


D909I5 


Synechocystis sp. 

PCC6803 

complete 

genome, 17/27, 

2137259- 

2267259. 


0.92 


YEK9 YEAS 
T 


HYPOTHETICAL 53.9 
KD PROTEIN IN 
AFG3-SEB2 
INTERGENIC 
REGION>PIR2:S5047 
7 hypothetical protein 
YER019w- yeast 
(Saccharomyces 
cerevisiae)>GP:SCE95 
37_20 Saccharomyces 

CCICVlMaC dll UlllUoUIUC 

Vcosmids 9537, 9581, 
9495, 9867, and 
lambda clone 5898 


5.90E-05 


107 


AJ001101 


Mus musculus 
mRNA for 
gClqBPgene. 


0.92 


DMU58282J 


Drosophila 
melanogaster Bowel 
(bowl) mRNA, 
complete cds; 
Transcription factor; 
C2H2 zinc finger 
protein; zinc fingers 
have extensive 
sequence similarity to 
Drosophila odd- 
skipped 


3.50E-05 


108 


X57108 


Human gene for 
cerebroside 
sulfate activator 
protein, exons 1 0- 
14. 


0.92 


S69032 


hypothetical protein 
YPR144c- yeast 
(Saccharomyces 
cerevisiae)>GP:YSCP9 
659 1 7 Saccharomyces 
cerevisiae chromosome 
XVI cosmid 9659; 
Yprl44cp; Weak 
similarity near C- 
terminus to RNA 
Polymerase beta 
subunit (Swiss Prot; 
accession number 
P11213) 


4.30E-21 
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SEQ 
ID 
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DESCRIPTION 


P 
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ACCESSION 
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P 

VALUE 


109 


D14635 


Caenorhabditis 

Afro n c nXIA f/~\f 

cicgdns uvst\ ror 
EMB-5. 


0.91 


YM13YEAS 

T 
1 


PUTATIVE ATP- 

Ut rL IN JJC4N I KINA 

HELICASE 
YMR128W>PIR2:S53 
058 probable 
membrane protein 
i iviK. i zow - yeasi 
(Saccharomyces 
cerevisiae)>GP:SC955 
3_4 S;cerevisiae 
chromosome XIII 
cosmid 9553; 
Unknown; YM9553;04, 
probable ATP- 
dependent RNA 
helicase, len: 


0.69 


110 


B5S5O0 


CIT-HSP- 

187 T9 TFR PIT- 

HSP Homo 
sapiens genomic 
clone 387J2. 


0.91 


U97553J79 


Murine herpesvirus 68 

Suain WUIVlo, 

complete genome; 
Unknown 


0.00016 


111 


X03049 


E. coli DNA 
sequene 5' to 

oricrin nf 

replication oriC. 


0.9 


POL_MLVAV 


POL POLYPROTEIN 
(PROTEASE (EC 

TRANSCRIPTASE 
(EC 2.7.7.49); 
RIBONUCLEASEH 
(EC 

3.1.26.4))>PIR1:GNM 
VGV pol polyprotein - 
AKV murine leukemia 
virus 


0.0019 


112 


U91327 


Human 

chromosome 

12pl5BAC clone 

CIT987SK-99D8 

complete 

sequence. 


0.89 


JC5568 


serine protease (EC 
3.4.-.-) hi -Serratia 
marcescens 


1 


113 


X 13295 


Rat mRNA for 
alpha-2u 
globulin-related 
protein. 


0.89 


MNGPOLYl 


Mengo virus 
polyprotein genome, 
complete cds withe 
repeats 


1 


114 


Z78415 


Caenorhabditis 
elegans cosmid 
C17G1, complete 
sequence. 


0.89 


AB000121J 


Mouse mRNA for 
TBPIP, complete cds; 
TBP1 interacting 
protein 


0.39 
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ACCESSION 
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ACCESSION 
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ID 
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VALUE 


115 


AC002308 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome 
22qll BAC 
Clone 1000e4; 
HTGS phase 1, 
26 unordered 
pieces. 


0.88 


YLK2_CAEE 
L 


HYPOTHETICAL 
122.7 KD PROTEIN 
D 1044.2 IN 
CHROMOSOME 
III>GP:CELD1 044 j* 
Caenorhabditis elegans 
cosmid D1044 


0.0037 


116 


AC002073 


Human PAC 
clone DJ515N1 
from 22qll.2- 
q22, complete 
sequence. 


0.88 


S28499 


probable finger protein 
- rat>GP:RNZFPJ 
R;norvegicus mRNA 
for putative zinc finger 
protein 


1.10E-31 


117 


Z83848 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
57A13;HTGS 
phase 1 . 


0.87 


NDL DROM 
E 


SERINE PROTEASE 
NUDEL PRECURSOR 
(EC 3.4.21.- 
)>PIR2:A57096 nudel 
protein precursor - fruit 
fly (Drosophila 
melanogaster)>GP:DM 
U29 153 J Drosophila 
melanogaster nudel 
(ndl) mRNA, complete 
cds; Serine protease; 
Soma dependent gene 
required matern 


1 


118 


U23449 


Caenorhabditis 
elegans cosmid 
K06A1. 


0.87 


AF023268J 


Homo sapiens clk2 
kinase (CLK2), 
propinl, cotel, 
glucocerebrosidase 
(GBA), and metaxin 
genes, complete cds; 
metaxin pseudogene 
and glucocerebrosidase 
pseudogene; and 
thrombospondin3 
(THBS3) 


0.21 


119 


Z68181 


H. vulgaris 
mRNA for 
elongation factor 
EF1 -alpha. 


0.87 


RABCY450C 
_1 


Rabbit cytochrome P- 
450 gene, clone pP- 
450PBc3, 3' end 


0.14 
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P 

VALUE 


120 


AC000033 


Homo sapiens 
chromosome Q 

complete 
sequence. 


0.87 


VWF_CANF 


VON WILLEBRAND 
FACTOR 

PRECURSOR>GP:DO 
GVWG_1 Canis 
fami Maris von 
Willebrand factor 


0.036 


121 


U23449 


Caenorhabditis 
elegans cosmid 
K06A1. 


0.86 


S48988J 


CRP- 1 =cystatin-related 
protein [rats, Wistar 
albino, mRNA Partial, 
213 nt];Cy statin- 
related protein; 
Method: conceptual 
translation supplied by 
author; This sequence 
comes from Fig; 


0.64 


122 


Z89651 


F.rubripes GSS 
sequence, clone 
090I24cD5. 


0.86 


CPU65981J 


Cryptosporidium 
parvum P-ATPase gene 
(CppA-El)gene, 
complete cds; Putative 
calcium-ATPase 


0.6 


123 


7040 


Human F»JA 

sequence from 
PAC 24M15on 
chromosome 1. 
Contains 
tenascin-R 
(restrictin), EST. 


U.OD 


VJJU ID J UN I 

3 


DEPENDENT 
GLUTAMATE 
SYNTHASE 1 (EC 
1.4.7.1) (FD- 
GOGAT)>PIR2:S6022 
8 glutamate synthase 
(ferredoxin) (EC 
1.4.7.1) gltB- 
Synechocystis sp. (PCC 
6803)>GP:D90902_66 
Synechocystis sp; 
PCC6803 complete 
genome, 4/27, 402290- 
524345; Gluta 




124 


Z49250 


Human DNA 
sequence from 
cosmid HW2, 
Huntington's 
Disease Region, 
chromosome 
4pl6.3. 


0.86 


TRSCAPSID 
1 


Tobacco ringspot virus 
capsid protein gene, 
complete cds 


3.00E-06 
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ID 
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P 
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P 
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125 


Z92855 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
Y48C3- HTGS 
phase 1. 


0.84 


AE000809J 


Methanobacterium 
thermoautotrophicum 
from bases 161632 to 
172569 (section 15 of 
148) of the complete 
genuine, rvbpanyi". 
tRNA synthetase; 
Function Code:10;07- 
Metabolism of 


1 


126 


AC002340 


*** 

SEQUENCING 
IN PROGRESS 

thaliana TAMU* 
BAC T11J7 
genomic 
sequence near 
marker ^283'; 
HTGS phase 1,2 
unordered pieces. 


0.83 


CET01E8_3 


Caenorhabditis elegans 
cosmid T01E8, 
complete sequence; 

Similar tn 1 

l v l co,j, oimilar to 1** 

phosphatidylinositol- 
4,5-bisphosphate 
phosphodiesterase; 
cDNA EST CEESG02F 
comes from this gene; 


0.86 


127 


AL008716 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
206C7; HTGS 
phase 1 . 


0.83 


HIVU51189_5 


HIV-1 clone 93th253 
from Thailand, 
complete genome; Tat 
protein 


0.86 


128 


AC002340 


*** 

SEQUENCING 
IN PROGRESS 
*** Arabidopsis 
thaliana TAMU' 
BAC T11J7' 
genomic 
sequence near 
marker 'm283'; 
HTGS phase 1,2 
unordered pieces. 


0.83 


S60257 


meltrin alpha - 
mouse>GP:MUSMAB 
_1 Mouse mRNA for 
meltrin alpha, complete 
cds 


0.0013 
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P 
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129 


Z83848 


Human DNA 
sequence *** 
SEOUENCING 
IN PROGRESS 
*** from clone 
57A13;HTGS 
phase 1 . 


0.82 


AROl PNEC 
A 


PENTAFUNCTIONAL 
AROM 

POT YPFPTIDF 
(CONTAINS: 3- 
DEHYDROQUINATE 
SYNTHASE (EC 
4.6.1.3), 3- 

DEHYDROQUINATE 
DEHYDRATASE (EC 
4.2.1.10) (3- 
DEHYDROQUINASE) 
, SHIKIMATE 5- 
DEHYDROGENASE 
(PC 1 1 1 

^•L^Vw' 1,1.1 ,4* J )j 

SHIKIMATE KINASE 
(EC 2.7.1.71), AND 
EPSP SYNTHASE (E 


0.0098 


130 


AF029308 


Homo sapiens 
chromosome 9 
duplication of the 
T cell receptor 
beta locus and 
trypsinogen gene 
families. 


0.8 


CELZK84_5 


Caenorhabditis elegans 
cosmid ZK84; Final 
exon in repeat region; 
similar to long tandem 
repeat region of 
sialidase 

(SP:TCNA TRYCR, 
P23253) and 
neurofilament H 
protein; coded for by C; 
elegans 


2.00E-08 


131 


AC002458 


Human BAC 
clone RG098M04 
from 7q21-q22, 
complete 
sequence. 


0.78 


IGF2_PIG 


INSULIN-LIKE 
GROWTH FACTOR II 
PRECURSOR (IGF- 
II)>GP:SSIGF2_1 
S;scrofa mRNA IGF2 
for insulin-like-growth 
factor 2; Insulin- like- 
growth factor 2 
preproprotein 


0.44 


132 


Z83843 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
368A4; HTGS 
phase 1 . 


0.78 


PAR51AJ 


P;tetraurelia 51 A 
surface protein gene, 
complete cds 


0.0014 
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ID 


ACCESSION 
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P 
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P 
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133 


X03021 


Human gene for 
granulocyte- 
macrophage 
colony 

stimulating factor 
(GM-CSF). 


0.78 


CEF57B1_3 


Caenorhabditis elegans 
cosmid F57B1, 
complete sequence; 
F57B1;3; Protein 
predicted using 
Genefinder; similar to 
collagen 


2.20E-05 


134 


Z74825 


S.cerevisiae 
chromosome XV 
reaaing irame 
ORF YOL083w. 


0.77 


SYLM SCHP 
O 


PUTATIVE LEUCYL- 
TRNA 

CVXITUtTT A CC 

MITOCHONDRIAL 
PRECURSOR (EC 
6.1.1.4) (LEUCINE- 
TRNA 

LIGASE)>PIR2:S6248 
6 hypothetical protein 
SPAC4G8.09 - fission 
yeast 

^oCJiizosaccnarornyces 
pombe)>GP:SPAC4G8 
_9 S;pombe 
chromosome I cosmid 
c4G8; Unknown; 
SPAC 


0.96 


135 


Z74825 


S.cerevisiae 
chromosome XV 
reading frame 
ORF YOL083w. 


0.77 


RNU59809J 


Rattus norvegicus 
mannose 6- 
phosphate/insulin-like 
growth factor II 
receptor (M6P/IGF2r) 
mRNA, complete cds; 
Also termed IGF- 
I I/Man 6-P receptor, 
MPR, CI-MPR 


0.01 


136 


U80445 


Caenorhabditis 
elegans cosmid 
C50F2. 


0.76 


S28499 


probable finger protein 
- rat>GP:RNZFPJ 
R;norvegicus mRNA 
for putative zinc finger 
protein 


1.10E-3I 


137 


Z78545 


Caenorhabditis 
elegans cosmid 
M03B6, complete 
sequence. 


0.75 


RRU73586J 


Rattus norvegicus 
Fanconi anemia group 
C mRNA, complete 
cds; Fanconi anemia 
group C protein; 
Similar to human FAC 
protein, GenBank 
Accession Numbers 
X66893 and X66894 


0.023 
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138 


Z97630 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 

11 will vIV/ilv 

466N1;HTGS 
phase 1. 


0.74 


HSMSHREC 
A_l 


H;sapiens mRNA for 
MSH receptor; Author- 
given protein sequence 
is in conflict with the 

conventual translation 


0.036 


139 


AF007269 


Arabidopsis 
thaliana BAC 
IG002N01 


0.71 


HSU95090J 


Homo sapiens 
chromosome 19 cosmid 

sequence; F 19541 _1; 
Hypothetical (partial) 
protein similar to 
proline oxidase 


0.16 


140 


AC002393 


Mouse 
BAC284H12 
Chromosome 6, 
complete 
sequence. 


0.7 


RNLTBP2J 


Rattus norvegicus 
mRNA for LTBP-2 like 
protein; Latent TGF- 
beta binding protein-2 
like protein 


4.40E-05 


141 


B 15232 


344G8.TV 
CIT978SKA1 
Homo sapiens 
genomic clone A- 
344G08. 


0.67 


DMSEVL2_2 


Drosophila 

melanogaster sevenless 
mRNA; Put; sevenless 
protein (A A 1-2510) 


0.41 


142 


D 13748 


Human mRNA 
for eukaryotic 
initiation factor 
4AI. 


0.66 


MMU53563J 


Mus musculus Brgl 
mRNA, partial cds; N- 
terminal region of the 
protein 


0.00016 


143 


S45791 


band 3-related 
protein=renal 

aninn pvplmntjpr 

AE2 homolog 
[rabbits, New 
Zealand White, 
ileal epithelial 
cells, mRNA, 
3964 nt]. 


0.66 


POLS RUBV 
R 


STRUCTURAL 
POLYPROTEIN 

NUCLEOCAPSID 
PROTEIN C; 
MEMBRANE 
GLYCOPROTEINS El 
AND 

E2)>PIR1:GNWVRA 
structural polyprotein - 
rubella virus (strain 
RA27/3 

vaccine)>GP:RUBCE2 
1 1 Rubella virus 
RA27/3 RNA for 
capsid, E2 and El 
proteins; Poly 


5.60E-05 


144 


M22462 


Chicken protein 
p54 (ets-1) 


0.66 


HSHP8PROT 
J 


H;sapiens mRNA for 
HP8 protein; HP8 


2.00E-06 
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mRNA, complete 
cds. 






peptide 




145 


U27999 


Human clone 
pDEL52All 
HLA-C region 
cosmid 52 

fypnnmif* inrvpv 

sequence. 


0.65 


CA18 HUMA 
N 


COLLAGEN ALPHA 
l(VIII) CHAIN 
PRECURSOR 
(ENDOTHELIAL 

15435 collagen alpha 
l(VIII) chain precursor 

human>GP:HSCOL8A 
1 1 Human CO! 8A1 
mRNA for alpha 
l(VIll) collagen 


5.70E-06 


146 


M54787 


N.crassa mating 
type a-1 protein 
(mt a-1) gene, 
exons 1-3. 


0.64 


150717 


vacuolar H+-ATPase A 
subunit - chicken 
(fragment)>GP:GGU22 
078 J Gallus gallus 
vacuolar H+-ATPase A 
subunit gene, partial 
cds 


0.0046 


147 


AC002094 


Genomic 
sequence from 
Human 17, 
complete 
sequence. 


0.63 


PVPVA1J 


Pjvivax pval gene 


0.1 


148 


U32701 


Haemoohilus 
influenzae from 
bases 165345 to 
176101 (section 
16 of 163) of the 
complete 
genome. 


0.63 


FABG HAET 
N 


CARRIER PROTEIN] 
REDUCTASE (EC 
1.1.1.100) (3- 
KETOACYL- ACYL 
CARRIER PROTEIN 
REDUCTASE)>PIR2: 
D64051 3-oxoacyl- 
[acyl-carrier-protein] 
reductase (EC 
1.1.1.100)- 
Haemophilus 
influenzae (strain Rd 
KW20)>GP:HIU32701 
_7 Haemophilus 


2 OOF- 12 


149 


Z37159 


T.brucei serum 
resistance 
associated (SRA) 
mRNA for VSG- 
like protein. 


0.61 


<NONE> 


<NONE> 


<NONE> 
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150 


AF027865 


Mik tniisriiliK 

1TJUJ UIUjvUIUj 

Major 

Histocompatibilit 
y Locus class II 
region. 


0.61 


A56514 


rhrnmnlf inf*cin — 

1*111 UlllfJUJUCAlII 

chicken>GP:GGU1830 
9_1 Gallus gallus 
chromokinesin mRNA, 

WlFlliJJlClG tUj 


w.UU 


151 


U40938 


Caenorhabditis 
elegans cosmid 
D1009. 


0.61 


YA53 SCHP 
O 


HYPOTHETICAL 24.2 
KD PROTEIN 
C13A11.03IN 
CHROMOSOME 
I>GP:SPAC13A11_3 
S;pombe chromosome I 
cosmid cl3Al 1; 
Unknown; 
SPAC13A11;03, 
unknown, len: 210 


1.90E-24 


152 


116670 


Sequence 1 from 
patent US 
5476781. 


0.59 


CELF21F8_7 


Caenorhabditis elegans 
cosmid F21F8; Similar 
to eukaryotic aspartyl 
proteases 


0.39 


153 




Human OKI A 
nUIIlali L/1NAV 

sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
299D3; HTGS 
phase 1 . 




PI m VPA<! 

T 


PROTEIN 

CLG1>PIR2:S37607 

pvpIiti-IiVp nmtpin 

\sj\-/Hll jJl IHGill 

YGL2I5w- yeast 
(Saccharomyces 
cerevisiae)>GP:SCYG 
L215W_1 S;cerevisiae 
chromosome VII 
reading frame ORF 
YGL215w>GP:YSCC 
LG1CPRJ 
Saccharomyces 
cerevisiae cyclin-like 
protein (CLGl)gene 


U.UU 1 J 


154 


U00054 


Caenorhabditis 
elegans cosmid 
K07E12. 


0.57 


<NONE> 


<NONE> 


<NONE> 


155 


M21207 


Synthetic SV40 T 
antigen mutant 
pseudogene, 3' 
end. 


0.57 


1CJL2 


cathepsin L (EC 
3.4.22.15) mutant 
(F(78P)L, C25S, 
T110A,E176G, 
D178G), fragment 2- 
human 


0.43 
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156 


AF020282 


Dictyostelium 
discoideum 
DG2033 gene, 
partial cds. 


0.56 


AC002125_4 


Homo sapiens DNA 
from chromosome 19- 
cosmid F25965, 
genomic sequence, 
complete sequence; 
F25965J; 

Hypothetical 35;3 kDa 
protein similar to 
GTPase-activating 
proteins and orf3 from 


0.6 


157 


M86352 


Stigmatella 
aurantiaca reverse 
transcriptase (163 
RT) gene, 
complete cds. 


0.56 


AC002398_4 


Human DNA from 
chromosome 19- 
specific cosmid 
F25965, genomic 
sequence, complete 
sequence; F25965_3; 
Hypothetical 96 kDa 
human protein similar 
to alpha chimaerin; 
Hypothetical 
proiein-^or .ai^uuzjvo 
_4 Human DNA from 
chromosome 19- 
specific cosmi 


4.50E-06 


158 


AC003101 


*** 

SEQUENCING 
IN PROGRESS 
*** Homo 
sapiens 

chromosome 17, 
clone 

HRPC41C23; 
HTGS phase 1, 
33 unordered 
pieces. 


0.54 


<NONE> 


<NONE> 


<NONE> 


159 


B12117 


F5L15-T7 IGF 
Arabidopsis 
thai i ana genomic 
clone F5L15. 


0.54 


CEF32H2_5 


Caenorhabditis elegans 
cosmid F32H2, 
complete sequence; 
F32H2;5; Similarity to 
Chicken fatty acid 
synthase (SW:P12276); 
cDNA ESTykl6c2;5 
comes from this gene; 
cDNA EST yk!13h6;5 
comes 


1 
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Nearest 
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(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


160 


AE000664 


Mus musculus 
TCR beta locus 
from bases 
250554 to 501917 
(section 2 of 3) of 
the complete 
sequence. 


0.54 


CET01G9_6 


Caenorhabditis elegans 
cosmid T01G9, 
complete sequence; 
T01G9;4; CDNA EST 
yk29b7;5 comes from 
this gene 


0.84 


161 


B12117 


F5L15-T7 IGF 

A nini Hnncic 

thaliana genomic 
clone F5L15. 


0.54 


A39718 


nicotinic acetylcholine 
rcvcpior aipna cnain * 
marbled electric ray 
(fragments) 


0.27 


162 


Z71261 


Caenorhabditis 
elegans cosmid 
F21C3, complete 
sequence. 


0.5 


KDGE DRO 
ME 


EYE-SPECIFIC 
DIACYLGLYCEROL 
KINASE (EC 
2.7.1.107) (RETINAL 
DEGENERATION A 
PROTEIN) 
(DIGLYCERIDE 
KINASE) 

(DGK)>GP:DRODAG 

K 1 Fruit flv mRNA 

for diacylglycerol 
kinase, complete cds 


4.60E-05 


163 


M61831 


Human S- 
adenosylhomocys 
teine hydrolase 
(AHCY) mRNA, 
complete cds. 


0.49 


P2C2 ARAT 
H 


PROTEIN 

PHOSPHATASE 2C 
(EC 3.1.3.16) 
(PP2C)>PIR2:S55457 
phosphoprotein 
phosphatase (EC 
3.1.3.16) 2C- 
Arabidopsis 
thaliana>GP:ATHPP2 
CA_1 Arabidopsis 
thaliana mRNA for 
protein phosphatase 2C 


5.60E-08 


164 


U42608 


Glycine max 
clathrin heavy 
chain mRNA, 
complete cds. 


0.48 


<NONE> 


<NONE> 


<NONE> 
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Nearest 
Neighbor 
(BlastN vs. 
Gen bank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VAIJIF 


ACCESSION 


DESCRIPTION 


P 

VALUE 


165 


Z93042 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
6B17; HTGS 
phase 1 . 


0.47 


PYRD BACS 
U 


DIHYDROOROTATE 

DEHYDROGENASE 

(EC 1.3.3.1) 

(DIHYDROOROTATE 

OXIDASE) 

(DHODEHASE)>PIRl 

:H39845 

dihydroorotate oxidase 
(EC 1.3.3.1) -Bacillus 
subtilis>GPN:BSUB00 
09_25 Bacillus subtilis 
complete genome 
(section 9 of 21): from 
1598421 to 1807200; 


0.002 


166 


AC000044 


Human 
Chromosome 
22ql3 Cosmid 
Clone p/oeiu, 
complete 
sequence. 


0.47 


MATK MAR 
PO 


PROBABLE INTRON 
MATURASE>PIR2:A 
05034 hypothetical 
protein 370i - liverwort 
(Marchantia 
polymorpha) 
chloroplast>GP:CHMP 
XX_21 Liverwort 
Marchantia 

polymorpha chloroplast 
genuine uiNrv, 
ORF370i 


0.0011 


167 


X51508 


Rabbit mRNA for 
aminopeptidase N 
(partial). 


0.47 


S45361 


LRR47 protein - fruit 
fly (Drosophila 
me!anogaster)>GP:DM 
LRR47J 

D;melanogaster mRNA 
for LRR47 


5.30E-07 


168 


Z67035 


H.sapiens DNA 
segment 
containing (CA) 
repeat; clone 
AFM323yf 1 ; 
single read. 


0.45 


JQ2246 


22.5K cathepsin D 
inhibitor protein 
precursor - 

potato>GP:POTCATH 
D_l Potato cathepsin D 
inhibitor protein 
mRNA, complete cds 


0.79 


169 


Z93042 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
6B17; HTGS 
phase 1 . 


0.44 


SMU31768J 


Schistosoma mansoni 
elastase gene, 3045 bp 
clone, complete cds 


0.0022 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 




TIF QCU TPTIOIV 


p 

VALUE 






P 

VALUE 


170 


LI 1172 


Plasmodium 
falciparum RNA 
polymerase I 
gene, complete 
cds. 


0.43 


HUMPKD1G0 
8J 


Homo sapiens 
polycystic kidney 
disease (PKDl)gene, 
exons 43-46; Polycystic 
kidney disease 1 
protein 


1 


171 


Z95889 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
211A9; HTGS 
phase 1 . 


0.43 


A09811J 


R;norvegicus mRNA 
for BRL-3 A binding 
protein; Author-given 
protein sequence is in 
conflict with the 
conceptual translation 


0.00083 


172 


U32772 


Haemophilus 
influenzae from 
bases 954819 to 
966363 (section 
87 of 163) of the 
complete 
genome. 


0.43 


YPT2 CAEE 
L 


HYPOTHETICAL 21.6 
KD PROTEIN 
F37A4.2 IN 
CHROMOSOME 
1II>PIR2:S44639 
F37A4.2 protein - 
Caenorhabditis 
elegans>GP:CELF37A 
4_8 Caenorhabditis 
elegans cosmid F37A4 


2.50E-28 


173 


Z99281 


Caenorhabditis 
elegans cosmid 
Y57G11C, 
complete 
sequence. 


0.42 


PTU19464_1 


Paramecium tetraurelia 
outer arm dynein beta 
heavy chain gene, 
complete cds 


1 


174 


X04571 


Human mRNA 
for kidney 
epidermal growth 
factor (EGF) 
precursor. 


0.42 


YEK9 YEAS 
T 


HYPOTHETICAL 53.9 
KD PROTEIN IN 
AFG3-SEB2 
INTERGENIC 
REGION>PIR2:S5047 
7 hypothetical protein 
YER019w- yeast 
(Saccharomyces 
cerevisiae)>GP:SCE95 
37_20 Saccharomyces 
cerevisiae chromosome 
Vcosmids 9537, 9581, 
9495, 9867, and 
lambda clone 5898 


0.99 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


175 


U32772 


Haemophilus 
influenzae from 
bases 954819 to 
966363 (section 
87 of 163) of the 
complete 
genome. 


0.41 


YPT2CAEE 
L 


HYPOTHETICAL 21.6 
KD PROTEIN 
F37A4.2 IN 
CHROMOSOME 
III>PIR2:S44639 
F37A4.2 protein - 
Caenorhabditis 
elegans>GP:CELF37A 
4_8 Caenorhabditis 
elegans cosmid F37A4 


7.80E-21 


176 


AC002053 


Human 

9p22 Cosmid 
Clone 92f5, 
complete 
sequence. 


0.4 


HSU33837J 


Human glycoprotein 

precursor, mRNA, 
complete cds 


1 


177 


U88309 


Caenorhabditis 
elegans cosmid 
T23B3. 


0.4 


DROMTTGN 
CJ 


Drosophila 
melanogaster 
mitochondrial 
cytochrome c oxidase 
subunit I (COI) gene, 5' 
end, Trp-, Cys-, and 
Tyr-tRNA genes, 
NADH dehydrogenase 
subunit 2 (ND2) gene, 
3' end 


0.99 


178 


M34025 


Human fetal \o 

heavy chain 
variable region 
(clone M44) 
mRNA, partial 
cds. 


0.39 


DNA2 YFAS 
T 


DNA RFPI IPATION 
HELICASE 
DNA2>PIR2:S48904 
probable purine 
nucleotide-binding 
protein YHR164c- 
yeast (Saccharomyces 
cerevisiae)>GPN:YSC 
H9986J 
Saccharomyces 
cerevisiae chromosome 
VIII cosmid 9986; 
Dna2p: DNA 
replication helicase; 
YHR1640GP: 


1 
i 


179 


AC002395 


Homo sapiens; 
HTGS phase 1, 
127 unordered 
pieces. 


0.39 


VVJvlUMPE 


NONSTRUCTURAL 
PROTEIN V 
(NONSTRUCTURAL 
PROTEIN NS1) 


0.11 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


180 


AC003101 


*** . 

SEQUENCING 
IN PROGRESS 
*** Homo 
sapiens 

chromosome 1 7 
clone 

HRPC41C23; 
HTGS phase 1, 
33 unordered 
pieces. 


0.39 


YLK2 CAEE 
L 


HYPOTHETICAL 
122.7 KD PROTEIN 
D 1044,2 IN 
CHROMOSOME 
III>GP:CELD1044_4 
CaenorhahditK eleeans 
cosmid D1044 


0.0001 


181 


Z54335 


Human DNA 
sequence from 
cosmid L17A9, 
Huntington's 
Disease Region, 
chromosome 
4pl6.3. Contains 
VNTR and a CpG 
island. 


0.39 


HUMNFAT3 
A J 


Homo sapiens NF-AT3 
mRNA, complete cds 


1.60E-06 


182 


U95743 


Homo sapiens 
chromosome 16 
BAC clone 
CIT987-SK65D3, 
complete 
sequence. 


0.38 


CEZC434 6 


Caenorhabditis elegans 
cosmid ZC434, 
complete sequence; 
ZC434;6; CDNA EST 
CEESO02F comes 
from this gene; cDNA 
EST CEESS60F comes 
from this gene 


0.18 


183 


AC001229 


Sequence of BAC 

F5I14from 

Arabidopsis 

thaliana 

chromosome 1 , 

complete 

sequence. 


0.34 


HSOCAMJ 


H;sapiens mRNA for 
immunoglobulin-like 
domain-containing 1 
protein 


0.051 


184 


X01703 


Human gene for 
alpha-tubulin (b 
alpha 1). 


0.33 


NTC3 MOUS 
E 


NEUROGENIC 
LOCUS NOTCH 3 
PROTEIN>PIR2:S453 
06 notch 3 protein - 
mouse>GP:MMNOTC 
_1 M;musculus mRNA 
for Notch 3 


0.012 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


185 


Z82189 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
170A21;HTGS 
phase 1 . 


0.31 


LG106_3 


Lemna gibba 
negatively light- 
regulated mRNA 
(Lgl06); Second 
longest ORF (2) 


0.27 


186 


Z98051 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
501A4; HTGS 
phase 1 . 


0.3 


S34960 


NADH dehydrogenase 
(ubiquinone) (EC 
1.6.5.3) chain 5- 
Crithidia oncopelti 
mitochondrion 
(SGC6)>GP:MICOCN 
NR_3 Crithidia 
oncopelti mitochondrial 
ND4, ND5,COI, 12S 
ribosomal RNA eenes 
for NADH 

dehydrogenase subunit 
4/5, cytochrome 
oxidase subun 


0.25 


187 


Z98749 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
449017; HTGS 
phase 1. 


0.3 


SCKC LEIQ 
H 


CHARYBDOTOXIN 
(CHTX) (CHTX- 
LQ1)>PIR2:A60963 
charybdotoxin 1 - 
scorpion (Leiurus 
quinquestriatus)>3D:2 
CRD Charybdotoxin 
(nmr, 12 structures) - 
scorpion (Leiurus 
quinquestriatus) 


0.12 


188 


X96763 


C.albicans CDC4 
gene. 


0.29 


CECC4_1 


Caenorhabditis elegans 
cosmid CC4, complete 
sequence; CC4;a; 
Protein predicted using 
Genefinder; 
preliminary prediction 


1.30E-I7 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


189 




rorpnyra 

purpurea 

chloroplast 

genome, 

complete 

sequence. 


n oft 


til VoCLJKJC 
J 


riuman 

immunodeficiency 
virus type 1 heavy- 
chain complemetarity- 
determining region 3 
mix. in a. ^cione 1 i ), 
partial cds; Heavy- 
chain complementarity- 
determining region 3 
(CDR3) from HIV 
gpl20- 

>GP:HIVHCDR3M 
Human 

intrnunuuciiciciiiy 
virus type 1 he 


i 
i 


190 


U20657 


Human ubiquitin 
protease (Unph) 
proto-oncogene 
mRNA, complete 
cds. 


0.28 


HSU20657J 


Human ubiquitin 
protease (Unph) proto- 
oncogene mRNA, 
complete cds 


5.60E-12 


191 


AC002037 


Human 

Chromosome 1 1 
Overlapping 
Cosmids 
cSRL72g7 and 

L/OIVU> 1 HUUO, 

complete 
sequence. 


0.27 


VRP1 YEAS 
T 


VERPROLIN>GP:SC 
VERPRLJ 
S;cerevisiae (A364) 
gene for verprolin 


2.00E-11 


192 


U58748 


Caenorhabditis 
elegans cosmid 
ZK180. 


0.27 


EXLP TOBA 
C 


PISTIL-SECIFIC 
EXTENSIN-LIKE 
PROTEIN 
PRECURSOR 
(PELP)>PIR2:JQ1696 
pistil extensin-like 
protein precursor (clone 
pMG15)- common 
tobacco>GP:NTPMGl 
5_1 N;tabacum mRNA 
for pistil extensin like 
protein 


4.10E-12 


193 


Z68013 


Caenorhabditis 
elegans cosmid 
W02H3, 
complete 
sequence. 


0.26 


<NONE> 


<NONE> 


<NONE> 
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(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BiastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


194 


AF017042 


Dictyostelium 
discoideum LTR- 
retrotransposon 
Skipper, partial 
genomic 
sequence, 5' end. 


0.26 


SPBC31F10 1 
4 


S;pombe chromosome 
II cosmid c31F10; 
Hypothetical protein; 
SPBC31F10;14c, 
unknown, len:1586aa, 
some similarity eg; to 
YJR140C, 
YJ9HYEAST, 
P47 171, involved in 
cell cvcle rpcnilatinn 


1 


195 


B03174 


cSRL-16e2-u 
cSRL flow sorted 
Chromosome 1 1 
specific cosmid 
Homo sapiens 
genomic clone 
cSRL-16e2. 


0.26 


CELC30E1 J7 


Caenorhabditis elegans 
cosmid C30E1 


0.38 


196 


X70810 


E.gracilis 
chloroplast 
complete 
genome. 


0.25 


CEK10H10_8 


Caenorhabditis elegans 
cosmid Kl OH 10, 
complete sequence; 
K10H10;k; Protein 
predicted using 
Genefinder; 
preliminary prediction 


0.98 


197 


U80024 


Caenorhabditis 
elegans cosmid 
C18B10. 


0.25 


MMAF001794 
J 


Mus musculus Treacher 
Collins Syndrome 
protein (Tcofl)mRNA, 
complete cds; Putative 
nucleolar 

phosphoprotein; similar 
to Homo sapiens 
Treacher Collins 
syndrome TCOF1 
protein 

encoded>GP:MMAF00 
1794_1 Mus musculus 
Treacher Collins 
Syndrome p 


0.017 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


198 


AC000591 


Drosophila 
melanogaster 
(subclone 9_g3 
from PI DS01486 
(D32)) DNA 
sequence, 
complete 
sequence. 


0.25 


YHGE ECOL 
1 


HYPOTHETICAL 64.6 
KD PROTEIN IN 
MRCA-PCKA 
INTERGENIC 
REGION 

(F574)>PIR2:E65135 
hypothetical 64.6 kD 
protein in mrcA-pckA 
intergenic region - 
Escherichia coli (strain 
K- 

12)>GP:ECAE000415_ 
7 Escherichia coli , 
mrcA, yrfE, yrfF, yrfG, 
yrfH, yrfl 


0.00068 


199 


AC000591 


Orosonhila 
melanogaster 
(subclone 9_g3 
from PI DS01486 
(D32)) DNA 
sequence, 
complete 
sequence. 


0.25 


YHGE ECOL 

I 


HYPOTHFTICAI 64 6 
KD PROTEIN IN 
MRCA-PCKA 
INTERGENIC 
REGION 

(F574)>PIR2:E65135 
hypothetical 64.6 kD 
protein in mrcA-pckA 
intergenic region - 
Escherichia coli (strain 
K- 

12)>GP:ECAE000415_ 
7 Escherichia coli , 
mrcA, yrfE, yrfF, yrfG, 
yrfH, yrfl 


0 00068 














200 


Z99571 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
388N15;HTGS 
phase 1 . 


0.24 


YA53 SCHP 
O 


HYPOTHETICAL 24.2 
KD PROTEIN 
C13A1I.03IN 
CHROMOSOME 
I>GP:SPAC13A11_3 
S;pombe chromosome I 
cosmid C13A11; 
Unknown; 
SPAC13A11;03, 
unknown, len: 210 


0.017 


201 


U00672 


Human 
interleukin-10 
receptor mRNA, 
complete cds. 


0.24 


TFDP00900 


- Polypeptides entry 
for factor Oct-2.5 


1.00E-05 
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Genbank) 
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(BlastX vs. 
Non- 
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Proteins) 


S£Q 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


202 


AC003061 


*** 

SEQUENCING 
IN PROGRESS 
*** Mouse 
Chromosome 6 
BAC clone 
b245cl2; HTGS 
phase 2, 8 
ordered pieces. 


0.23 


CGI HUMA 
N 


CGI 

PROTEIN>GP:HSU46 
023 J Human Xq28 
mRNA, complete cds; 
Orf 


0.00078 


203 


AF009420 


Homo sapiens 
microsatellite 
sequence in the 
HNF3a gene. 


0.22 


PN0675 


collagen alpha 

1 (XVIII) chain - mouse 

(fragment)>GP:MUSC 

OLLAG 1 Mouse 

mRNA for collagen, 

partial cds 


0.00072 


204 


B18861 


F20C18-Sp6 IGF 
Arabidopsis 
thaliana genomic 
clone F20C18. 


0.22 


TFDP00659 


- Polypeptides entry 
for factor PR 


0.0003 


205 


U00672 


Human 
interleukin-10 
receptor mRNA, 
complete cds. 


0.22 


TFDP00900 


- Polypeptides entry 
for factor Oct-2.5 


1.00E-05 


206 


X52105 


Dictyostelium 
discoideum SP60 
gene for spore 
coat protein. 


0.18 


<NONE> 


<NONE> 


<NONE> 


207 


L07628 


Saccharopolyspor 
a erythraea 
insertion 

sequence IS 1136, 
copy B, 3' end. 


0.17 


D88764J 


Rana catesbeiana 
mRNA for alpha 2 type 
I collagen, complete 
cds 


0.00021 


208 


Z49631 


Sxerevisiae 
chromosome X 
reading frame 
ORF YJR131w. 


0.16 


YSCDAL1A 
1 


Saccharomyces 
cerevisiae alantoinase 
(DAL1) gene, complete 
cds 


1 


209 


Z87893 


F.rubripes GSS 
sequence, clone 
043C17aB8. 


0.16 


CELC27A12 
8 


Caenorhabditis elegans 
cosmid C27A12; 
Partial CDS; this gene 
begins in the 
neighboring clone; 
coded for by C; elegans 
cDNA ykl27fl;3; 
coded for by C; elegans 
cDNA yk!27fl;5 


1.30E-07 
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Non- 
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Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


210 


U92852 


Rhoiptelea 
chiliantha 

TTiJitiirfl^p (mntK \ 

gene, chloroplast 
gene encoding 
chloroplast 
protein, complete 
cds. 


0.15 


SEU40259_5 


Staphyloccous 
epidermidis 

tri m pt hr\nri tt> r**cictnnpf* 

11 JlllClllUUI 1111 I VoiolClilUC 

plasmid pSK639; Orf53 


0.95 


211 


X62620 


B.mori Abd-A 
gene nurneutAJA. 


0.15 


ATAP22J36 


Arabidopsis thaliana 
ui\f\ turomosomc h, 
ESSA I AP2 contig 
fragment No; 2; 
Hypothetical protein; 
Similarity to NADH 
dehydrogenase, 
Chondrus crispus; 
MNOS:S59107 


0.75 


212 




CJJoLClIi Udl I VII Ud 

simple repeat 
array (ir3). 


ft i ^ 
yj. i j 




uura-iugii-buiiur 
keratin 1 - 

mouse>GP:MUSSERl 
_1 Mouse serine 1 ultra 
high sulfur protein 
gene, complete cds; 
Putative 




213 


M35027 


Vaccinia virus, 

complete 

genome. 


0.14 


MTF1 FUSN 
U 


MODIFICATION 
METHYLASE FNUDI 
(EC 2.1.1.73) 
(CYTOSINE- 
SPECIFIC 

METHYLTRANSFER 
ASE FNUDI) 
(M.FNUDI) 


0.87 


214 


AC003058 


♦** 

SEQUENCING 
IN PROGRESS 
*** Arabidopsis 
thaliana 'IGF 
BAC 'F27F23' 
genomic 
sequence near 
marker 
'CIC06E08'; 
HTGS phase 1,8 
unordered pieces. 


0.14 


HEXA DICDI 


BETA- 

HEXOSAMINIDASE 
ALPHA CHAIN 
PRECURSOR (EC 
3.2.1.52) (N-ACETYL- 
BETA- 

GLUCOSAMINIDASE 
) (BETA-N- 
ACETYLHEXOSAMI 
NIDASE)>PIR2:A307 
66 beta-N- 

acetylhexosaminidase 
(EC 3.2. 1.52) A 
precursor - slime mold 
(Dictyostelium 


0.006 
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D 
r 

VALUE 






Jr 

VALUE 












discoideum)>GP:DDIN 
AGAJ D;d 




215 


AC001229 


Sequence of BAC 

F5I14from 

Arabidopsis 

thaliana 

chromosome 1, 

complete 

sequence. 


0.13 


A49281 


pol protein - simian T- 
cell lymphotropic virus 
type 1, STL V-l (isolate 
Bab34) 

(fragment)>GP:STVB 
ABPOLA_l Simian T- 
cell leukemia virus 
PCR derived (pol) 
gene, partial sequence 
BAB34POL; Bases 
4779-4918 EMBL 
ATK numbering 
system; BAB34POL 


0.77 


216 


U46067 


Capra hircus 
beta-mannosidase 
mRNA, complete 
cds. 


0.12 


S70663 


lectin heavy chain, N- 
acetylgalactosamine- 
specific - Entamoeba 
histolytica 

(fragment)>GP:EHU33 
443_I Entamoeba 
histolytica GalNAc 
lectin heavy subunit 
(hgl4) gene, partial cds; 
N-acetylgalactosamine 
adherence lectin heavy 
subunit 


0.8 


217 


AC000380 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome 3 
pacpDJ70ill; 
HTGS phase 1,2 
unordered pieces. 


0.12 


ATFCA8J9 


Arabidopsis thaliana 
DNA chromosome 4, 
ESSA I contig 
fragment No; 8; 
Unnamed protein 
product 


0.64 
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P 

VALUE 


ACCESSION 
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P 
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218 


X61207 


A.brasilense hisB, 

H, A, F and E 

genes for 

imidazole 

glycerolphosphat 

e dehydratase, 

glutamine 

amidotransferase, 

phosphorybosilfo 

rmimino-5- 

amino- 

phosphorybosil- 
4- 

imidazolecarboxa 
mide isomerase, 
cyclase and 
phosphorybosil- 
AMP- 

cyclohydrolase. 


0.12 


OCCL02J 


Ojcircumcincta colost- 
2 gene; Cuticular 
collagen 


0.0074 


219 


AF014259 


HIV-1 Patient 
1088 from 
Edinburgh, MA- 
P 1 ' \Efl&) gene, 
partial cds. 


0.11 


DMU88570J 


Drosophila 
melanogaster CREB- 
binding protein 
nurnuiug rnixiNrv, 
complete cds; CBP 


1 


220 


AC000636 


Drosophila 
melanogaster 
(subclone 2_cl 1 
from PI DS0766A 
(D44))DNA 
sequence, 
complete 
sequence. 


0.11 


A64829 


hypothetical protein in 
dmsC 3' region - 
Escherichia coli (strain 
K- 

IX" 

12)>GP:ECAE000192_ 
1 Escherichia coli , 
ycaD, ycaK, pflA, pflB, 
focA genes from bases 
944908 to 955952 
(section 82 of 400) of 
the complete genome; 
Hypothetical protein in 
dmsC 


0.051 


221 


AC002428 


Human BAC . 
clone GS039E22 
from 5q31, 
complete 
sequence. 


0.11 


HSNMYC2J 


Human N-myc gene 
exon 2; Put; N-myc 
protein (aa 1-263) (953 
is 1st base incodon) 


0.00014 


222 


L40949 


Homo sapiens 
(clone AT7-5eu) 
opioid -receptor- 
like protein 
mRNA, 5' end. 


0.11 


CEUNC93_2 


C;elegans unc-93 gene; 
Protein 2 


1.20E-13 
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SEQ 
ID 
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P 
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P 
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223 


AL008636 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
722E9; HTGS 
phase 1 . 


0.1 


XELCOL2A1 
A J 


Xenopus laevis alpha- 1 
collagen type IT 
mRNA, complete cds; 
Alpha-1 type IF 
collagen 


2.60E-06 


224 


L/OU77J 


Human NamhHa^ 

DNA for 
immunoglobulin 
light chain. 


0 1 


CEL M02B7 2 


cosmid M02B7 


1 80E-09 


225 


AC002539 


Homo sapiens 
chromosome 17, 
clone 195o20, 
complete 
sequence. 


0.098 


MTCY7D1 1 
17 


Mycobacterium 
tuberculosis cosmid 
Y7D11; Unknown; 
MTCY07Dll;17c; 
unknown, len: 1 86 aa, 
FASTAbest: Q10390 
Y009_MYCTU 
hypothetical 3 1;0 KD 
Drotein MTCY190 09C 
(299 aa) opt: 355 z- 
score: 316;8 


0.026 


226 


M88165 


Human inter- 
alpha-trypsin 
inhibitor light 
chain (ITI) gene, 
exon 1 . 


0.096 


A54161 


ryanodine-binding 
protein alpha form - 
bullfrog>GP:D21070_l 
Rana catesbeiana 
mRNA for bullfrog 
skeletal muscle calcium 
release channel 
(ryanodine receptor) 
alpha isoform(RyRl), 
complete cds; 
Ryanodine receptor 
alpha isoform 


1 


227 


Z92851 


Caenorhabditis 
elegansDNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
Y39G8; HTGS 
phase 1. 


0.082 


CYA7 BOVI 
N 


ADENYLATE 
CYCLASE, TYPE VII 
(EC 4.6.1.1) (ATP 
PYROPHOSPHATE- 
LYASE) (ADENYLYL 
CYCLASE) 


0.3 
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P 
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ACCESSION 
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P 
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228 


L00638 


Arabidopsis 
thaiiana ubiquitin 
conjugating 
enzyme exons 2- 
4. 


0.072 


NUCM TRY 
BB 


NADH-UBIQUINONE 
OXIDOREDUCTASE 
49 KD SUBUNIT 
HOMOLOG (EC 
1.6.5.3) (NADH 
DEHYDROGENASE 
SUBUNIT 7 
HOMOLOG)>PIR2:A3 
5693 NADH 
dehydrogenase (EC 
1.6.99.3) chain 7- 
Trypanosoma brucei 
mitochondrion (SGC6) 


0.24 


229 


U49169 


Dictyostelium 
discoideum V- 
ATPase A 
subunit (vatA) 
mRNA, complete 
cds. 


0.071 


MMU65594J 


Mus musculus Brca2 
mRNA, complete cds; 
Similar to human breast 
cancer susceptibility 
gene BRCA2; Allele: 
wild type; putative 
tumor suppressor 


1 


230 


AF001549 


Homo sapiens 
chromosome 16 
BAC clone 
CIT987SK- 
270G1 complete 
sequence. 


0.07 


PM22_HUMA 
N 


PERIPHERAL 
MYELIN PROTEIN 22 
(PMP- 

22)>PIR2:JN0503 
peripheral myelin 
protein 22 - 
human>GP:HUMGAS 
3X_1 Human 
peripheral myelin 
protein 22 (GAS3) 
mRNA, complete 
cds>GP:HUMPMP22_ 
1 Human peripheral 
myelin protein 22 
mRNA, complete 
cds>GP:HUMPMP22 


0.0078 


231 


L36829 


Mus musculus 
alphaA-crystallin- 
binding protein I 
(AlphaA- 
CRYBPl)gene, 
complete cds. 


0.066 


<NONE> 


<NONE> 


<NONE> 


232 


AC000159 


*** 

SEQUENCING 
IN PROGRESS 
*** Human BAC 
Clone llql3; 


0.058 


CEZK863J 


Caenorhabditis elegans 
cosmid ZK863, 
complete sequence; 
ZK863;2; Similar to 
collagen 


1 
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Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 






HTGS phase 1, 
1 0 unordered 
pieces. 










233 


AC000159 


*** 

SEQUENCING 
IN PROGRESS 

*** Human RAP 

nun ldii 
Clone llql3; 
HTGS phase 1, 
10 unordered 
pieces. 


0.058 


CAC2 HAEC 
O 


CUTICLE 
COLLAGEN 2C 
(FRAGMENT)>GP:H 

Hjcontortus collagen 
2C mRNA, 3'end 


1.20E-08 


234 


Z23908 


H. sapiens 
(D5S630) DNA 
segment 
containing (CA) 
repeat; clone 
AFM268zd9; 
single read. 


0.057 


VEU34999J 


Venezuelan equine 
encephalitis virus 
nonstructural and 
structural polyprotein 
genes, complete cds; 
Nonstructural 
polyprotein; Internal 
stop codon, 

rpJiHthrf\ticrh nrrnrQ 

of the time 


0.0002 


235 


B21875 


T3E8-Sp6 TAMU 
Arabidopsis 
thai i ana genomic 
clone T3E8. 


0.055 


YRR2 CAEE 
L 


HYPOTHETICAL 91.1 
KD PROTEIN R 144.2 
IN CHROMOSOME 
III>GP:CELR144J7 
Caenorhabditis elegans 
cosmid R144; Coded 
for by C; elegans 
cDNA CEESP84R; 
coded for by C; elegans 
cDNA yk23c4;5; coded 
for by C; elegans 
cDNA yk44f9;5; coded 
for by C; eleg 


0.68 


236 


Z98303 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
140H19; HTGS 
phase I. 


0.048 


AC002330J 


Arabidopsis thaliana 
BACT10P11, 
complete sequence; 
Putative zinc-finger 
protein; C2H2 Zn- 
finger signature from 
position 80 to 100 
[CEICNKGFQRDQNL 
QLHRRGH] 


0.99 
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237 


rviQQi 1 
utyy i i 


thermophilus 
UvrA gene, 
complete cds. 


0 C\AA 


APP1 \ACi\ 1^ 

E 


Ayvi nin.T TWF 

/\1V1 I LtKJLLJ-Lilr^E, 

PROTEIN 1 
PRECURSOR 
(APLP)>PIR2:A46362 
amyloid precursor-like 
protein - 

mouse>GP:MUSAPLP 
_1 Mouse amyloid 
precursor-like protein 
mRNA, complete cds 


R QfiF-fVi 

0.7UL-UO 


238 


LMyy I I 


i nerrnus 
thermophilus 
UvrA gene, 
complete cds. 




1_2 


Mus musculus alpha- 
l(XVIIl) collagen 
(COL 18A1) gene, 
exons 40-43, complete 
cds 




239 


X78119 


P.amygdalus, 
Batsch (Texas) 
prul mRNA. 


0.042 


CA44 HUMA 
N 


COLLAGEN ALPHA 
4(IV) CHAIN 
PRECURSOR>PIRl:C 
GHU1B collagen alpha 
4(IV) chain precursor - 
human>GP:HSCOL4A 

n, sapiens rnrviNA 
for collagen type IV 
alpha 4 chain; Type IV 
collagen alpha 4 chain 


2.00E-06 


240 


U72877 


Rana catesbeiana 
L-epinephrine 

mRNA, complete 
cds. 


0.041 


YRR6 MYCC 
A 


HYPOTHETICAL 33.0 
KD PROTEIN IN 

R6)>PIR2:S42125 
hypothetical protein 3 - 
Mycoplasma 
capricolum 
(SGC3)>GP:MYCRP 
MH_6 M; capricolum 
rpmH, rnpA and licA 
gene; Orf R6 


0.0008 


241 


L39891 


Homo sapiens 
polycystic kidney 
disease- 
associated protein 
(PKDl)gene, 
complete cds. 


0.04 


MUC2 HUM 
AN 


MUCIN 2 
(INTESTINAL 
MUCIN 2) 
(FRAGMENTS) 


5.90E-05 


242 


L40390 


Candida glabrata 
ERG3 gene, 
complete cds. 


0.039 


GO 1763 


atrophin-1 - 
human>GP:HSU23851 
_1 Human atrophin-1 
mRNA, complete cds 


9.00E-07 
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P 
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243 


B281 13 


T2L16TRB 
TAMU 
Arabidopsis 
thaliana genomic 

TOT 1 c 

clone l2Llo. 


0.038 


CELZK1248 
14 


Caenorhabditis elegans 
cosmid ZK1248 


1 /TAT? 1 O 

l .60E- 1 8 


244 


ACO00030 


00175, complete 
sequence. 


0.033 


ATFCA8_40 


Arabidopsis thaliana 
DNA chromosome 4, 
ESSA I contig 
fragment No; 8; 
Glycerol-3-phosphate 
permease homolog; 
Similarity to glycerol- 
3-phosphate permease - 
Haemophilus 
influenzae 


0.63 


245 


B 10738 


Fl3Gl5-Sp6 IGF 
Arabidopsis 
thaliana genomic 
clone F13G 15. 


0.032 


D87521_l 


Mus musculus DNA- 
PKcs mRNA, complete 
cds 


0.2 1 


246 


AF024503 


Caenorhabditis 
elegans cosmid 
F31F4. 


0.03 


138344 


titin - human 


l 


247 


Z49888 


Caenorhabditis 
elegans cosmid 
F47A4, complete 
sequence. 


0.027 


KSU52064J 


Kaposi's sarcoma- 
associated herpes-like 
virus ORF73 homolog 
gene, complete cds; 
Herpesvirus saimiri 
ORF73 

homolog>GP:KSU756 
98J78 Kaposi's 
sarcoma-associated 
herpesvirus long unique 
region, 80 putative 
ORF's and kaposin 
gene, complete cds; OR 


3.40E-10 


248 


Z83822 


Human DNA 
sequence from 
PAC 306D1 on 
chromosome X 
contains ESTs. 


0.025 


GRSB BACB 
R 


GRAMICIDIN S 
SYNTHETASE II 
(GRAMICIDIN S 
BIOSYNTHESIS 
GRSB PROTEIN) (EC 
6.-.-.-) 


l 


249 


Z94161 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
N102CI0; HTGS 


0.025 


SI 6323 


hypothetical protein - 
Arabidopsis 
thaliana>GP:ATHBl_l 
A;thaliana homeobox 
gene Athb-1 mRNA; 
Open reading frame 


0.0079 
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P 
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phase 1. 










250 


AC002O94 


Genomic 
sequence from 
Human 17, 
complete 
sequence. 


0.021 


S57447 


HPBRII-7 protein - 
human>GP:HSHPBRII 
4 1 H;sapiens HPBRII- 
4 

mRNA>GP:HSHPBRII 
7_1 H;sapiens HPBRII- 
7 gene 


8.20E-08 


251 


D79994 


Human mRNA 
for KIAA0172 
gene, partial cds. 


0.021 


CER10H10J 


Caenorhabditis elegans 
cosmid R10H10, 
complete sequence; 
R11A8;7; Protein 
predicted using 
Genefinder; Similarity 
to Mouse ankyrin (PIR 
Acc; No; S37771); 
cDNA EST CEESX25F 
comes from this gene; 


7.00E-16 


252 


Z97635 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
438L4; HTGS 

nhncp 1 


0.017 


CELW05H7_4 


Caenorhabditis elegans 
cosmid W05H7 


0.24 


253 


X84996 


X.laevis mRNA 
for selenocysteine 
tRNA acting 
factor (Stai). 


0.017 


JN0786 


integrin beta-4 chain 
precursor - mouse 


0.088 


254 


AC002543 


Human BAC 
clone RG300C03 
from 7q31.2, 
complete 
sequence. 


0.013 


MZLMTCYT 
BTJ 


Mendozellus isis 
mitochondrial NADH 
dehydrogenase, and 
cytochrome b genes, 3' 
end, and transfer RNA- 
Ser gene; This codes 
for the last 43 amino 
acids of NADH 
dehydrogenase subunit 
1 followed 


0.044 
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255 


U10401 


Caenorhabditis 
elegans cosmid 
T20B12. 


0.012 


MMMHC29N 
7_2 


Mus musculus major 
histocompatibility 
locus class III 
region:butyrophilin-like 
protein gene, partial 
cds; Notch4, PBX2, 
RAGE, lysophatidic 
acid acyl transferase- 
alpha, palmitoyl- 


0.069 


256 


L14593 


Saccharomyces 
cerevisiae protein 
phosphatase 
(PTCl)gene, 
complete cds. 


0.011 


D86995J 


Human (gene 1 ) DNA 
for phosphatase 2C 
motif, partial cds 


2.20E-14 


257 


U62317 


Chromosome 

22ql3BAC 

Clone 

CIT987SK- 
384D8 complete 
sequence. 


0.0093 


P2Y8 XENL 
A 


P2Y PURINOCEPTOR 
8 

(P2Y8)>GP:XLP2Y8__ 
1 X;laevis mRNA for 
P2Y8 nucleotide 
receptor 


0.89 


258 


D29655 


Pig mRNA for 
UMP-CMP 
kinase, complete 
cds. 


0.0075 


AF004858J 


Mus musculus platelet 
activating factor 
receptor mRNA, partial 
cds; PAF-receptor 


1 


259 


AF002992 


Homo sapiens 
cosmid from 
Xq28, complete 
sequence. 


0.0054 


FBN1 BO VI 
N 


FIBRILLIN 1 
PRECURSOR>PIR2:A 
55567 fibrillin I - 
bovine>GP:BOVXAA 
AA_1 Bos taurus 
mRNA, complete cds; 
Putative 


0.0004 


260 


B20752 


T19M2-T7 
TAMU 
Arabidopsis 
thaliana genomic 
clone T19M2. 


0.0043 


HSVT1IEPJ 


Feline herpesvirus type 
1 gene for immediate 
early protein, complete 
cds; Feline herpesvirus 
type 1 immediate early 
protein 


3.90E-05 
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261 


AB006699 


Arabidopsis 
thaliana genomic 
DNA, 

chromosome 5, 
PI clone: MDJ22. 


0.0037 


YHV5 YEAS 
T 


HYPOTHETICAL 
143.6 KD PROTEIN 
IN SPO16-REC104 
INTERGENIC 
REGION>PIR2:S4675 
4 hypothetical protein 

YHR 1 S*iw - vp;Kt 

(Saccharomyces 
cerevisiae)>GPN:YSC 
H9666_15 
Saccharomyces 
cerevisiae chromosome 
VIII cosmid 9666; 
Yhrl^Swn* Similar to 
Sip3p (Snf 


0.077 


262 


Z99128 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
422H1I;HTGS 
phase 1. 


0.0032 


ALU1 HUM 
AN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


0.0087 


263 


B21848 


T2D2-Sp6 
TAMU 
Arabidopsis 
thaliana genomic 
clone T2D2. 


0.0031 


B31794 


mdm-1 protein (clone 
cl03)- mouse 


1.00E-05 


264 


L33853 


Human germline 
immunoglobulin 

ill Jill Lll *\s Em.l\J \J VI 4111 

kappa chain 
variable region 
(Vk-IV subgroup) 
for anti-B- 
amyloid 

autoantibodies in 

Alzheimer's 

disease. 


0.0027 


B45550 


cytochrome b homolog 
- Plasmodium voelii 


0.99 


265 


B36863 


HS-1042-A1- 
F01-MR.abi CIT 
Human Genomic 
Sperm Library C 
Homo sapiens 
genomic clone 
Plate=CT 824 
CoI=l Row=K. 


0.0027 


YQK4 CAEE 
L 


HYPOTHETICAL 64.3 
KD PROTEIN 
C56G2.4 IN 
CHROMOSOME 
1II>GP:CELC56G2_2 
Caenorhabditis elegans 
cosmid C56G2 


0.81 
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266 


AC003041 


*** 

SEQUENCING 
IN PROGRESS 
*** Homo 
sapiens 

chromosome 17, 
clone 

HCIT307A16; 
HTGS phase 1, 
10 unordered 
pieces. 


0.0024 


GLB4 LAMS 
P 


GIANT 

HEMOGLOBIN AIV 
CHAIN 

(FRAG MENT)>PI R2 : 
S01810 hemoglobin 
AIV - tube worm 
(Lamellibrachia sp.) 
(fragment) 


0.94 


267 


AC002315 


Mouse BAC- 
146N21 

Chromosome X 

contains 

iduronate-2- 

sulfatase gene; 

complete 

sequence. 


0.0022 


MG42 TARM 
A 


SRY-RELATED 
PROTEIN MG42 
(FRAGMENT)>PIR3:I 
51369 Sry-related 
sequence - Tarentola 
mauritanica 
(fragment)>GP:TELM 
G42DNAJ Gecko 
MG42 gene, partial 
cds; Sry-related 
sequence 


0.99 


268 


AFO 16674 


Caenorhabditis 
elegans cosmid 
C0JH5. 


0.0015 


SCYJL204C 
1 


S;cerevisiae 
chromosome X reading 
trame UKr YJLzU4c 


1 


269 


AFO 16674 


Caenorhabditis 
elegans cosmid 
C03H5. 


0.0015 


CEM199J 


Caenorhabditis elegans 
cosmid Ml 99, 
complete sequence; 
M199;e; Protein 
predicted using 
Genefinder; 
preliminary prediction 


0.97 


270 


ArU 100/4 


Caenorhabditis 
elegans cosmid 
C03H5. 


n fin i ^ 




Caenorhabditis elegans 
cosmid Ml 99, 
complete sequence; 
M199;e; Protein 
predicted using 
Genefinder; 
preliminary prediction 


ft 07 


271 


Z54199 


L.esculentum 
DNA Ailsa craig 
encoding 1- 
aminocyclopropa 
ne-l-carboxylic 
acid oxidase. 


0.0015 


CELF20A1_5 


Caenorhabditis elegans 
cosmid F20A1; Coded 
for by C; elegans 
cDNA yk9gl;3; coded 
for by C; elegans 
cDNA yk9gl;5; coded 
for by C; elegans 
cDNA CEESU55F; 


0.11 
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P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 












weak similarity to 
putative 




272 


Z99943 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
313L4; HTGS 
phase 1. 


0.0014 


CEK08F8_5 


Caenorhabditis elegans 
cosmid K08F8, 
complete sequence; 
K08F8;5b 


0.93 


273 


S81083 


beta- 

ADD=adducin 
beta subunit 63 
kda 

isoform/mem bran 
e skeleton 
protein, beta - 
ADD=adducin 
beta subunit 63 
kda 

isoform/membran 
e skeleton protein 
{alternatively 
spliced, exon 10 
to 13 region} 
[human, 
Genomic, 1851 
nt, segment 3 of 
3]. 


0.0013 


MTCY277_7 


Mycobacterium 
tuberculosis cosmid 
Y277; Unknown; 
MTCY277;07c, 
unknown, len: 302 


0.0001 


274 


Z82174 


Human DNA 
sequence from 
cosmid B20F6 on 
chromosome 
22qll.2-qter. 


0.001 


FBLA HUM 
AN 


FIBULIN-1, 
1SOFORMA 
PRECURSOR>GP:HS 
FIBUAJ H;sapiens 
mRNA forfibulin-1 A 


0.00063 


275 


Z82215 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
6802; HTGS 
phase 1. 


0.00079 


BFR1 SCHP 
O 


BREFELDIN A 
RESISTANCE 
PROTEIN>PIR2:S522 
39 hba2 protein - 
fission yeast 
(Schizosaccharomyces 
pombe)>GP:SPHBA2 
GEN_1 S;pombe hba2 
gene 


0.15 
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P 
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P 
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276 


U28153 


Caenorhabditis 
elegans UNC-76 
(unc-76) gene, 
complete cds. 


0.00071 


CX2_HEMHA 


CYTOTOXIN 2 
(TOXIN 12A) 


0.32 


277 


Z82204 


Human DNA 
sequence from 
clone J362G171. 


0.00054 


DMU34925_2 


Drosophila 
melanogaster DNA 
repair protein (mei-41) 
gene, complete cds, and 
TH1 gene, partial cds 


0.045 


278 


AC002530 


Human BAC 
clone RG341D10 
from 7pl5-p21, 
complete 
sequence. 


0.00053 


CELT28F2_2 


Caenorhabditis elegans 
cosmid T28F2; Weak 
similarity to HSP90 


0.037 


279 


U91322 


Human 

chromosome 

16pl3 BAC clone 

CIT987SK-276F8 

complete 

sequence. 


0.00051 


CEW08D2_2 


Caenorhabditis elegans 
cosmid W08D2, 
complete sequence; 
W08D2;3; Protein 
predicted using 
Genefinder>GP:CEW0 
8D2_2 Caenorhabditis 
elegans cosmid 
W08D2; W08D2;3; 
Protein predicted using 
Genefinder 


0.26 


280 


D16986 


Human HepG2 
partial cDNA, 
clone 

hmd2b09m5. 


0.00037 


POLG PPVN 
A 


GENOME 
POLYPROTEIN 

/TTVMT A TXTC* XI 
{L,LJlN 1 AllNo. N- 

TERMINAL 
PROTEIN; HELPER 
COMPONENT 
PROTEINASE (EC 
"? 4 22 -1 <"HC-PROV 
42-50 KD PROTEIN; 
CYTOPLASMIC 
INCLUSION 
PROTEIN (CI); 6 KD 
PROTEIN; NUCLEAR 
INCLUSION 
PROTEIN A (NI- A) 
(EC 3.4.22.-) (49K 
PROTEINASE) (49 


0.48 


281 


U91318 


Human 
chromosome 
16pl3 BAC clone 
CIT987SK- 
962B4 complete 


0.00031 


<NONE> 


<NONE> 


<NONE> 
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ID 


ACCESSION 
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P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 






sequence. 










282 


M93406 


Human dispersed 
Alu repeats and 
dispersed LI 
repeat. 


0.0003 


VG8_SPV4 


GENE 8 

PROTEIN>PIRl:G8BP 
SV gene 8 protein - 
spiroplasma virus 4 
(SGC3) 


0.23 


283 


AC002398 


Human DNA 
from 

chromosome 19- 
specific cosmid 
F25965, genomic 
sequence, 
complete 
sequence. 


0.0002 1 


HMCA DRO 
ME 


HOMEOTIC 
CAUDAL 

PROTEIN>PIR2:A263 
57 homeotic protein 
Cad - fruit fly 
(Drosopnila 
melanogaster)>GP:DR 
OCADA2J 
D;melanogaster caudal 
gene (cad) encoding a 
maternal and zygotic 
transcript, exon 2; 
Caudal 

protein>TFD:TFDP00 1 
59 - Polypeptides en 


0.021 


284 




Unman DAP 

Human dal 
clone RG341D10 
from 7pl5-p21, 
complete 
sequence. 


u.uuuz 




compiemeni 
C3d/Epstein-Barr virus 
receptor precursor - 
human 


n 7 
u. / 


285 


X01871 


Yeast 

mitochondrial 
ori(o) repeat unit 

of nptitf* mutant S 

(petite strain s- 
10/7/2). 


0.000 15 


RVZMTCYT 
BTJ 


Reventazonia sp; 
mitochondrial NADH 
dehydrogenase, and 

rvtnrhromf* h o(*np^ *V 

end, and transfer RNA- 
Ser gene; This codes 
for the last 43 amino 
acids of NADH 
dehydrogenase subunit 
1 followed 


0.73 


286 


U89984 


Acanthamoeba 
castellanii 
transformation- 
sensitive protein 
homolog mRNA, 
complete cds. 


0.00015 


ACU89984_1 


Acanthamoeba 
castellanii 
transformation- 
sensitive protein 
homolog mRNA, 
complete cds; Similar 
to human 
trans formation- 


4.20E-13 
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SEQ 
ID 
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P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 












sensitive protein: 
SwissProt Accession 
Number P3 1948 




987 


AC002365 


Homo sapiens 
chromosome X 
clone U177G4, 
U152H5, 
U168D5, 174A6, 
U172D6,and 
U186B3 from 
Apzz, complete 
sequence. 


0.0001 1 


S 10340 


DNA-directed RNA 
polymerase (EC 
2.7.7.6) - yeast 
(Kluyveromyces 
marxianus var. lactis) 


0.00062 


288 


AC002390 


Human DNA 
from overlapping 
chromosome 19- 

bpeCIIlC L-OaiTiIUb 

R30072 and 
R28588, genomic 
sequence, 
complete 
sequence. 


9.90E-05 


D86603J 


Mouse mRNA for Bach 
protein 1 , complete cds; 
Bachl 


1 


289 


AC002980 


Homo sapiens; 
HTGS phase 1, 
34 unordered 
pieces. 


9.20E-05 


TRBKPCYB 
1 


Trypanosoma brucei 
kinetoplast 

apocytochrome b gene, 
complete cds 


0.52 


290 


M99412 


Human 
interleukin-8 
receptor (IL8RB) 
gene, complete 
cds. 


4.50E-05 


S28832 


microtubule-associated 
protein HI (clone 
KS3.1) - longfin squid 
(fragment) 


0.88 


291 


AC000120 


Human BAC 
cloneRG161K23 
from 7q21, 
complete 
sequence. 


4.00E-05 


SXSCRBAJ 


Sjxylosus scrB and 
scrR genes; Sucrose 
repressor 


0.99 
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P 
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292 




numu bapicnb, 
HTGS phase 1, 
66 unordered 
pieces. 




o i j joy 


nypotneiiLai proiein j ■ 
Lactococcus lactis 
subsp. lactis insertion 
sequence 

1076>GP:LLTLE_1 
Lactococcus lactis 
DNA for the 
transposon-like element 
on the lactose plasmid; 
ORF5 (AA 1 - 43) 


W.u 1 o 


293 


Z81512 


Caenorhabditis 
elegans cosmid 
F25C8, complete 
sequence. 


2.40E-05 


MUSDBPRC 
1 


Mus musculus DNA- 
binding protein Rc 
mRNA, complete cds; 
DNA binding protein 
Rc 


1 


294 


B16681 


343C3.TVB 
CIT978SKA 1 
Homo sapiens 
genomic clone A- 
343C03. 


1.10E-05 


COPP_YEAS 
T 


COATOMERBETA' 
SUBUNIT (BETA - 
COAT PROTEIN) 
(BETA'- 

COP)>PIR2:B55123 
coatomer complex beta' 
chain - yeast 
(Saccharomyces 

LerCVlilaC/^VJr IN.OV^ T 

GL137W_1 
S;cerevisiae 
chromosome VII 
reading frame ORF 
YGL137w>GP:SCUll 
237_1 Saccharomyces 
cerevisiae 


0.081 


295 


Z 16523 


H. sapiens 
(D9S158)DNA 
segment 
containing (CA) 
repeat; clone 
AFM073ybll; 
single read. 


1.00E-05 


MMSEMFJ 


M;musculus mRNA for 
semaphorin F; 
Smaphorin F 


0.78 


296 


Z49704 


S.cerevisiae 
chromosome XIII 
cosmid 8021. 


5.60E-06 


<NONE> 


<NONE> 


<NONE> 


297 


AC003071 


Human BAC 
clone BK085E05 
from 22ql2.1- 
qter, complete 
sequence. 


3.00E-06 


HSRCAER_1 


H;sapiens mRNA for 
red cell anion 
exchanger (EPB3,AE1, 
Band 3) 3' non-coding 
region 


0.21 
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ID 


ACCESSION 
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P 

VALUE 


298 


U20428 


Human SNC19 
mRNA sequence. 


1 40E-06 


HUMMUC2A 
_1 


Human mucin-2 gene, 
partial cds 


4.40E-06 


299 


U51903 


Human RasGAP- 
related protein 
(IQGAP2) 
ini\.iN/\, Lumpicie 
cds. 


6.60E-07 


IQGA HUMA 
N 


RAS GTPASE- 
ACTIVATING-LIKE 
PROTEIN IQGAP1 

Ras GTPase activating- 
related protein - 
human>GP:HUMIQG 
A_l Homo sapiens ras 
GTPase-activating-like 
protein (IQGAP1) 
mRNA, complete cds; 
Amino acid feature: IQ 
calmodulin-binding do 


1.60E-14 


300 




r.ruonpcs uoj 
sequence, clone 
021G08aAl. 


A 7fip 07 
H. /UEr-U / 


1V1 1 1 J lvi T i Ci 

D 


K/fPTAf T HTH T OKI P TXI 

10-111 (MT-10- 
III)>PIR2:S39418 
metallothionein 10-III - 
blue mussel 


7 70P in 


301 




U.imnn DAT 

o urn an d/vv^ 
clone RG134C19 
from 8q21, 
complete 
sequence. 


A ^OP fi7 


jiLj / r\ 1 U J 


o, porn uc cnromosomc i 
cosmid c57A10; 
Unknown; 
SPAC57A10;05;c, 
unknown, len:606aa, 
similar to A; nidulans 
Q00659, sulfur 
metabolite repression 
control, (678aa), fasta 
scores, opt: 1355, 




302 


AC003089 


Human BAC 
clone 

RG180F08A, 

complete 

sequence. 


3.80E-07 


HPBPRECK 
1 


Hepatitis B virus type 
1 1 precore protein (pre- 
C region, C) gene, 5' 
end 


0.41 


303 


AC002074 


Human BAC 
clone GS056H18 
from 7q31-q32, 
complete 
sequence. 


2.40E-07 


A47021_l 


Sequence 23 from 
Patent W09527787; 
Unnamed protein 
product; Author-given 
protein sequence is in 
conflict with the 
conceptual 

translation>GP:A51260 

1 Sequence 23 from 
Patent W09614416; 
Unnamed protein 
product; Author-given 


0.0016 



220 



WO 99/33982 



PCT/US98/27610 



Table 2 



'.'},' " *!*.?!" 
■: •: '• ■. - : : ' , 


Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 
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P 
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protein sequence is i 




304 


U04980 


Rattus norvegicus 
fetal troponin T 3 
(fetal TnT3) 
mRNA, partial 
cds. 


2.20E-07 


HUMFSHDJ 


Human 

facioscapulohumeral 
muscular dystrophy 
(FSHD) gene region, 
D4Z4 tandem repeat 

unit* fW) 17 

unit, \Js\r 


3.30E-08 


305 


U68704 


Human 
chromosome 
21q22.3 PI -clone 
3804 subclone 4- 


2.00E-07 


HHV6AGNM 
_96 


Human herpesvirus-6 
(HHV-6)U1102, 
variant A, complete 
virion genome; U88; 

V^Yd ICUCdlo, 11113 lULrl lo 

open in all six reading 
frames, part of IE-A 


2.70E-05 


306 


U51583 


Rattus norvegicus 
zinc finger 
homeodomain 

pnhnnppr-hinHincr 

protein- 1 (Zfhep- 
1) mRNA, partial 
cds. 


8.70E-08 


AF005370_67 


Alcelaphine 
herpesvirus 1 L-DNA, 
complete sequence; 

Putntivf* immpHintp 
i lilaiivc luitiicuiaic 

early protein; ORF73; 
similar to H; saimiri 
and KSHVORF73 


6.10E-07 


307 


M80206 


Mus domesticus 
poliovirus 
receptor homolog 
(MPH) mRNA, 
complete cds. 


8.10E-08 


153960 


PRR2 alpha - human 


1.70E-28 


308 


M60854 


Human ribosomal 
protein S16 
mRNA, complete 
cds. 


5.70E-08 


OLVPOLJ 


Caprine arthritis 
encephalitis virus 
(isolate OVLV-Nl)pol 
protein gene, 3' end of 
cds; Nt 2497-2695 
from CAEV Co 


0.27 


309 


U82828 


Homo sapiens 
ataxia 

telangiectasia 
(ATM) gene, 
complete cds. 


1.50E-08 


C40201 


artifact- warning 
sequence (translated 
ALU class C) - human 


0.00044 
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310 




Unman FYM A 

Human UNA 
sequence from 
PAC 111J24 on 
chromosome 
22ql2-qter 
contains ESTs. 


1 AC\X2 no 

1 .4Ub-Uo 


UCI AH1 1 

HdUo44/J_l 


Human rheumatoid 
arthritis synovium 
immunoglobulin heavy 
chain variable region 
mRNA, partial 
cds>GP:HSU64498J 
Human rheumatoid 
arthritis synovium 
immunoglobulin heavy 
chain variable region 
mRNA, partial cds 




311 


Z50029 


Caenorhabditis 
elegans cosmid 
ZC504, complete 
sequence. 


1 A f\l~* AO 

1 .40E-08 


MMU88984 1 


Mus musculus NIK 
mRNA, complete cds 


1 .70E-50 


312 


AC002351 


Homo sapiens; 
HTGS phase 1, 
17 unordered 
pieces. 


1.20E-08 


D41132 


collagen-related protein 
4 - Hydra 
magnipapillata 
(fragment)>PIR2:S219 
32 mini-collagen - 
Hydra 

sp>GP:HSNCOL4 1 
HydraN-COL4 
miviM-A ior inim- 
collagen; No start 
codon 


0.02 


313 


B65763 


CIT-HSP- 
2023A12.TR 
CIT-HSPHomo 
sapiens genomic 
clone 2023A12. 


3.60E-09 


S18106 


type II site-specific 
deoxyribonuclease (EC 
3.1.21.4) AbrI- 
AzosDirillum brasilense 

* *■ Swt\J*J\-J * * 111 Villi \J* fcl*Jl iwlli?v 


0.045 


314 


Z93021 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
516C23;HTGS 
phase 1. 


2.00E-09 


AB001684 13 
4 


Chlorella vulgaris C-27 
chloroplast DNA, 
complete sequence; 
RNA polymerase 
gamma subunit 


0.6 


315 


D88035 


Rat mRNA for 
glycoprotein 
specific UDP- 
glucuronyltransfe 
rase, complete 
cds. 


1.50E-09 


D88035J 


Rat mRNA for 
glycoprotein specific 
UDP- 

glucuronyltransferase, 
complete cds 


1.00E-33 
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316 


U85193 


Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
complete cds. 


1.30E-10 


VGF1JBVB 


F1 

PROTEIN>PIRl:VFIH 
Bl Fl protein - avian 
infectious bronchitis 
virus (strain 
Beaudette)>GP:IBACG 
B__l Avian infectious 
bronchitis virus pol 
nrotein snilce nrotein 
small viri on-associated 
protein, membrane 
protein, and 
nucleocapsid protein 
een 


1 


317 


B04719 


cSRL-42G12-u 
cSRL flow sorted 
Chromosome 1 1 
specific cosmid 
Homo sapiens 
genomic clone 
cSRL-42G12. 


7.90E-11 


JC5238 


galactosylceramide-like 
protein, GCP - human 


0.31 


318 


M73506 


Mouse Tcp-lOc (t 
allele) gene. 


2.80E-11 


A39487 


T-complex protein 10a 
(allele 129) - mouse 


4.10E-16 


319 


U71148 


Human Xq28 
cosmids U225B5 
and U236A12, 
complete 
sequence. 


1.20E-11 


A56547 


sex-peptide precursor - 
Drosophila suzukii 


0.4 


320 


Z95116 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
57G9; HTGS 
phase 1 . 


9.90E-13 


ALU2 HUM 
AN 


!!!! ALU 

SUBFAMILY SB 
WARNING ENTRY 

MM 


0.0017 


321 


M64795 


Rat MHC class I 
antigen gene 
(RTl-u 
haplotype), 
complete cds. 


1.70E-14 


STCDROME 


SHUTTLE CRAFT 
PROTEIN>GP:DMU0 
9306J Drosophila 
melanogaster shuttle 
craft protein (stc) 
mRNA, complete cds; 
C-terminal 222 amino 
acids encode a novel 
single- stranded DNA 
binding domain 


1.40E-13 
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322 


Y09036 


H.sapiens 
NTRK1 gene, 
exon 17. 


4.20E-15 


AF010403J 


Homo sapiens ALR 
mRNA, complete cds; 
Alternatively spliced; 
similarity to ALL- 1 and 
Drosophila trithorax 


1 


323 


U 12523 


Rattus norvegicus 
ultraviolet B 
radiation- 
activated UV98 
mRNA, partial 
sequence. 


2.90E-15 


SPBC30D10 
4 


S;pombe chromosome 
II cosmid C30D10; 
Hypothetical protein; 
SPBC30D10;04, 
unknown, len:148aa 


2.40E-09 


324 


Z98755 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
76C18; HTGS 
phase 1. 


2.20E-15 


RPONHAL 
MA 


DNA-DIRECTED 
RNA POLYMERASE 
SUBUNITN(EC 
2.7.7.6)>PIR2:D41715 
DNA-directed RNA 
polymerase II chain 
RPBlOhomolog- 
Haloarcula 

m arism onu kj tr . n/\ l# 
HMAENOA_4 
H;marismortui tRNA- 
Leu, HL29, HmaL13, 
HmaS9, OrfMMV, 
OrfMNA 2- 
phosphoglycerate 
dehydr 


0.019 


325 


M86917 


Human oxysterol- 
binding protein 
(OSBP) mRNA, 
complete cds. 


1.60E-15 


CEF14H8_2 


Caenorhabditis elegans 
cosmid F14H8, 
complete sequence; 
F14H8;1; Similarity to 
Human oxysterol- 
binding protein 
(SW:OXYB_HUMAN) 


2.10E-18 


326 


AC001231 


Genomic 
sequence from 
Human 17, 
complete 
sequence. 


1.30E-15 


AC002397J 


Mouse BAC284H12 
Chromosome 6, 
complete sequence; 
DRPLA 


0.0016 


327 


AL008626 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
1114G22; HTGS 
phase 1 . 


5.30E-16 


TAU48227J 


Triticum aestivum 
soluble starch synthase 
mRNA, partial cds 


5.90E-05 
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328 


L04483 


Human ribosomal 
protein S2 1 
(RPS21)mRNA, 
complete cds. 


7.60E-17 


RS21 HUMA 
N 


40S RIBOSOMAL 
PROTEIN 
S21>PIR2:S34108 
ribosomal protein S21 - 
human>GP:SSZ84015_ 
1 S;scrofa mRNA; 
expressed sequence tag 
(3'; clone cllglO); 40S 
ribosomal protein S21; 
Similar to human 40S 
ribosomal protein 
S21>GP:HUMRPS21X 
_1 Human ribosomal 


1.40E-09 


329 


AB001899 


Homo sapiens 
PACL4 gene, 
exon 2. 


6.70E-17 


LRP1HUMA 
N 


LOW-DENSITY 

T 1 T^/^M~>TI ATT TX T 

LIPOPROTEIN 
RECEPTOR- 
RELATED PROTEIN 

1 PRECURSOR (LRP) 
(ALPHA-2- 
MACROGLOBULIN 
RECEPTOR) (A2MR) 

fA POT TPOPPHTFrM 

E RECEPTOR) 
(APOER)>PIR2:S0239 

2 LDL receptor-related 
protein precursor - 
human>GP:HSLDLRR 
L_l Human mRNA for 
LDL-recept 


1 


330 


Z98755 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
76C18; HTGS 
phase 1 . 


4.40E-17 


U97553_59 


Murine herpesvirus 68 
strain WUMS, 
complete genome; 
Ribonucleotide 
reductase large 


0.06 


331 


AF017187 


Homo sapiens 
LTR HERV-K 
repetitive element 
fragment 
ltr_19_9a 
sequence. 


3.90E-18 


D84255J 


Ovophis okinavensis 
mitochondrial DNA for 
NADH dehydrogenase 
subunit 1 , partial cds, 
Ile-tRNA, Pro-tRNA, 
Phe-tRNA, Gin- tRNA, 
Met-tRNA and control 
region (D-loop region); 
This cds 


0.007 
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332 


B36252 


HS-1038-A2- 
GOl-MR.abi CIT 

11 U 1 11 all vJcilOIIllv 

Sperm Library C 
Homo sapiens 
genomic clone 
Plate=CT 820 
Col=2 Row=M. 


3.10E-18 


PGBM MOU 
SE 


BASEMENT 
MEMBRANE- 
^PFPTFir HFPARAN 

SULFATE 
PROTEOGLYCAN 
CORE PROTEIN 
PRECURSOR (HSPG) 
(PERLECAN) 
(PLC)>PIR2:S18252 
heparan sulfate 
proteoglycan - 
mniise>GP-Ml JSPFRP 
A_l Mouse perlecan 
mRNA, complete cds 


0.00015 


333 


D78255 


Mouse mRNA for 
PAP-1, complete 
cds. 


2.70E-18 


MUSPAP1J 


Mouse mRNA for 
PAP-1, complete cds 


3.50E-18 


334 


AC003046 


Human Xp22 
PACs RPC11- 
263P4 and 

R PP 11-1 fAK ^ 

complete 
sequence. 


1.40E-18 


CEC34F6J 


Caenorhabditis elegans 
cosmid C34F6; 
C34F6;1;CDNA EST 

vlf 1 *?* S rnmpQ frnm 

J'NtUU IX,J vlsliJ^.3 1IVJ11I 

this gene; cDNA EST 
yk44c4;5 comes from 
this gene; cDNA EST 
yk46bl2;3 comes from 
this gene 


0.0015 


335 


Amnion? 


Human DNA 

from overlapping 
chromosome 19- 
specific cosmids 
R29515and 
R28253, genomic 
sequence, 
complete 
sequence. 


1.40E-18 


MUSZFPO 1 


MoiKe mRNA for zinc 

IVlVUJw Jlll\l i/l 1VI tt\ I IV 

finger protein, partial 
sequence 


1.30E-19 


336 


Y 15054 


Rattus norvegicus 
mRNA for 70 
kDa tumor 
specific antigen, 
partial. 


3.40E-19 


HS4U2IR2J 


Epstein-Barr virus 
(AG876 isolate) U2- 
IR2 domain encoding 
nuclear protein 
EBNA2, complete cds; 
Nuclear antigen 2 


2.00E-06 


337 


Z97876 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
295C6; HTGS 


1.30E-19 


AF003535J 


Homo sapiens LI 
element ORF2-like 
protein gene, partial cds 


7.O0E-O5 



226 



WO 99/33982 



Table 2 



PCT/US98/27610 





Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
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phase 1. 










338 


M97159 


Mouse (clone 
pIL2)Bl 
dispersed repeat 
unit. 


1.10E-19 


A26882 


pIL2 hypothetical 
protein - rat 
(fragment)>GP:RATT 
DRJ Rat growth and 
transformation- 
dependent mRNA, 3' 
end; Growth and 
transformation 
dependent protein 


0.2 


339 


U30817 


Bos taurus very- 
long- chain acyl- 
CoA 

dehydrogenase 
mRNA, nuclear 

nana Annnninrr 

gene cnuuuing 
mitochondrial 
protein, complete 
cds. 


4.70E-20 


ACDVJRAT 


ACYL-COA 

UbH Y UKUOhN Aab, 

VERY-LONG-CHAIN 

SPECIFIC 

PRECURSOR (EC 

1 T 00 ^ 

(VLCAD)>PIR2:A548 
72 acyl-CoA 
dehydrogenase (EC 
1.3.99.-) very-long- 
chain-specific 
precursor - 
rat>GPRATVLCAD 
1 Rat mRNA for very- 
long-chain Acyl-CoA 
dehydrogenase, compl 


8.10E-25 


340 


Y11535 


H.sapiens mRNA 
for SHOXb 
protein. 


2.80E-20 


ALU1 HUM 
AN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
in j 


0.00027 


341 


AL008730 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
487J7; HTGS 
phase 1. 


7.10E-21 


C40201 


artifact-warning 
sequence (translated 
ALU class C) - human 


0.001 


342 


U96629 


Human 

chromosome 8 
BAC clone 
CIT987SK-2A8 
complete 
sequence. 


5.30E-23 


ALU1 HUM 
AN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


3.80E-10 
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343 


U95743 


Homo sapiens 
chromosome 16 
BAC clone 
CIT987-SK65D3, 
complete 
sequence. 


2.10E-24 


UROM HUM 
AN 


UROMODULIN 
PRECURSOR 
(TAMM-HORSFALL 
URINARY 
GLYCOPROTEIN) 
(THP)>PIR2:A30452 
uromodulin precursor - 
human>GP:HUMUMO 
D_l Human 
uromodulin (Tamm- 
Horsfall glycoprotein) 
mRNA, complete cds; 
Uromodulin precursor 


1 


344 


U 15972 


Mus musculus 
homeobox 
(Hoxa7) gene, 
complete cds. 


4.00E-25 


S20790 


extensin - 

almond>GP:PAEXTS_ 
1 P;amygdalus mRNA 
for extensin 


0.34 


345 


U 15972 


Mus musculus 
homeobox 
(Hoxa7) gene, 
complete cds. 


4.00E-25 


CA24 CAEE 
L 


COLLAGEN ALPHA 
2(IV) CHAIN 
PRECURSOR>GP:CE 
COLA2IV_2 C;elegans 
a2(IV) collagen gene; 
Alternatively spliced 
transcript 


0.1 


346 


Z66242 


H. sapiens CpG 
island DNA 
genomic Msel 
fragment, clone 
84a4, reverse read 
cpg84a4.rtla. 


4.80E-26 


CEC35A5_8 


Caenorhabditis elegans 
cosmid C35A5, 
complete sequence; 
C35A5;8; CDNA EST 
yk3 1 f6;5 comes from 
this gene; cDNA EST 
yk38hl;3 comes from 
this gene; cDNA EST 
yk38hl ;5 comes from 
this gene; 


7.70E-19 


347 


L25331 


Rattus norvegicus 
iysyl hydroxylase 
mRNA, complete 
cds. 


3.90E-26 


LYSH CHIC 
K 


PROCOLLAGEN- 
LYSINE,2- 
OXOGLUTARATE 5- 
DIOXYGENASE 
PRECURSOR (EC 
1.14.11.4) (LYSYL 
HYDROXYLASE)>PI 
R2:A23742 
procollagen-lysine 5- 
dioxygenase (EC 
1.14.1 1.4) precursor - 
chicken>GP:CHKLYH 
_1 Chicken Iysyl 


1.10E-43 
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hydroxylase mRNA, 
complete cds 




348 




Drosophila 
melanogaster 
(subclone 2 d7 
from PI DS04260 
(D68)) DNA 
sequence, 
complete 
sequence. 






Caenorhabditis elegans 
cosmid C52B9; Coded 
for by C; elegans 
cDNA cm 1 1 d6; weakly 
similar to S; cervisiae 
PTM1 precursor 
(SP:P32857) 




349 


U78082 


Human RNA 
polymerase 
transcriptional 
regulation 
mediator (h- 
MED6) mRNA, 
complete cds. 


2.30E-26 


HSU78082J 


Human RNA 
polymerase 
transcriptional 
regulation mediator (h- 
MED6) mRNA, 
complete cds; H- 
Med6p 


1.50E-16 


350 


U43381 


Human Down 
Syndrome region 
of chromosome 
21 DNA. 


2.10E-28 


HSMRNAEB 
1 


H;sapiens genomic 
DNA, integration site 
for Epstein-Barr virus; 
Hypothetical protein 


0.18 


351 


D50416 


Mouse mRNA for 
AREC3, 
complete cds. 


2.50E-29 


A29947 


prostaglandin- 
endoperoxide synthase 
(EC 1.14.99.1) 
precursor - 

sheep>GP:SHPCOXA_ 
1 Sheep prostaglandin 
endoperoxide 
synthetase 
(cyclooxygenase), 
complete cds; 
Cyclooxygenase 
precursor (EC 
1;14;99;1) 


0.81 
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352 


i lor i r\n 

U85193 


Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
complete cds. 


o iAr oft 

2.2QE-29 




Crithidia fasciculata 
fully edited ATPase 
subunit 6 (MURF4) 
mRNA, partial cds; 
Cryptogene 




353 


Z92826 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
C18D11;HTGS 
phase 1 . 


1.10E-30 


SPAC1B3J 


S;pombe chromosome I 
cosmid clB3; 
Hypothetical protein; 
SPAC1B3;05, probable 
transcriptional 
regulator, len:630aa, 
similar eg; to YIL038C, 
NOT3_YEAST, 
P06102, general 
negative regulator, 


3.20E-35 


354 


L09604 


Homo sapiens 
differentiation- 
dependent A4 
protein mRNA, 
complete cds. 


3.70E-32 


PVU72769J 


Phaseolus vulgaris 
PvPRP-12(Pvprpl-12) 
mRNA, partial cds; 
Similar to cell wall 
proline rich 

protein>Ur .r v u tz /ov 

1 Phaseolus vulgaris 
PvPRP-12(Pvprpl-12) 
mRNA, partial cds; 
Similar to cell wall 
proline rich protein 


0.00049 


j j j 


B42455 


HS-1055-B2- 
G03-MR.abi CIT 
Human Genomic 
Sperm Library C 

T-4rtmn csnipnc 

genomic clone 
Plate=CT 777 
Col=6 Row=N. 


1.30E-32 


CELT05H4J 


Caenorhabditis elegans 
cosmid T05H4; Similar 
to the beta transducin 
family; coded for by C; 

f*1f*oanQ rT"YNA 

ykl56el 1;3; coded for 
by C; elegans cDNA 
ykl4c8;3; coded for by 
C; elegans cDNA 


6.90E-14 


356 


AF001905 


Homo sapiens 
cosmids E079, 
B0920 and A8 
from Xq25 X- 
linked 

lymphoproliferati 
ve disease gene 
candidate region, 
complete 
sequence. 


1.80E-33 


138344 


titin - human 


1 
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357 


E03743 


DNA sequence 
including male 
hormone 
dependent gene 
derived from 
hamster 
frankorgan. 


1.10E-34 


CELC03A7J2 


Caenorhabditis elegans 
cosmid C03A7; Weak 
similarity xo serotonin 
receptors 


0.59 


358 


U31199 


Human laminin 
gamma2 chain 
gene (LAMC2), 
exon 22 and 
flanking 
sequences. 


1.20E-35 


B44018 


laminin B2t chain - 
human>GP:HSLAMB2 
TB_1 H;sapiens mRNA 
for laminin 


1.20E-14 


359 


D14678 


Human mRNA 
for kinesin- 
related protein, 
partial cds. 


2.00E-36 


D49544J 


Mouse mRNA for 
KIFC1, complete cds 


1.20E-23 


360 


AB000425 


Porcine DNA for 
endopeptidase 
24.16, exon 16 
and complete cds. 


8.20E-38 


POL4 DROM 
E 


RETROVIRUS- 
RELATEDPOL 
POLYPROTEIN 
(PROTEASE (EC 
3.4.23.-); REVERSE 
TRANSCRIPTASE 
(EC 2.7.7.49); 
ENDONUCLEASE) 
fTRANSPOSON 

412)>PIR1:0NFF42 
retrovirus-related pol 
polyprotein - fruit fly 
(Drosophila 
melanogaster) 
transposon 

412>GP:DMRT412G 
4 


0.65 


361 


U39875 


Rattus norvegicus 
EF-hand Ca2+- 
binding protein 
p22 mRNA, 
complete cds. 


8.80E-42 


156333 


apolipoprotein B - rat 
(fragment)>GP:RATA 
POLPB_l Rattus 
norvegicus (clone 
rb9E) apolipoprotein B 
apoB mRNA, 3' end 


0.23 
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362 




tvauus nurvcgicus 
hepatocyte 
nuclear factor 3a 
(HNF-3 beta) 
mRNA, complete 
cds. 


f\ 60F 49 


HTsHR RAT 


nCrn 1 Vj/V-, I ID 

NUCLEAR FACTOR 
3 -BETA (HNF- 
3B)>GP:RATHNF3B_ 
1 Rattus norvegicus 
hepatocyte nuclear 
factor 3a (HNF-3 beta) 
mRNA, complete 
cds>TFD:TFDP01611 
- rojypepuues entry ior 
factor HNF-3 (beta) 


8 10F-2S 


363 


D25538 


Human mRNA 
for KIAA0037 
gene, complete 
cds. 


4.10E-43 


CELC34D4 1 
2 


Caenorhabditis elegans 
cosmid C34D4 


0.018 


364 


Z56764 


H.sapiens CpG 
island DNA 
genomic Msel 
fragment, clone 

1 1 t / rot/area rao/i 

i j i / , re verac rcdci 
cpgl3f7.rtla. 


1.40E-43 


S75263 


hypothetical protein - 
Synechocystis sp. (PCC 
6803)>GP:D90904_29 
Synechocystis sp; 

Pf^fAfcft^ rnmnlptp 
rL'VuOwj t-UIIipivlC 

genome, 6/27, 630555- 
781448; Hypothetical 
protein; 

ORF_ID:sI10983 


0.0028 


365 


AC002636 


*** 

SEQUENCING 
IN PROGRESS 
*** Drosophila 
rnelanogaster 
(subclone 2_g4 
from PI DS03323 
(D127))DNA 
sequence; HTGS 
phase 2. 


8.40E-44 


DMU95760J 


Drosophila 
rnelanogaster 
strawberry notch (sno) 
mRNA, complete cds; 
Notch pathway 
component; nuclear 
protein 


3.40E-51 


366 


J05499 


Rattus norvegicus 
L-glutamine 
amidohydrolase 
mRNA, complete 
cds. 


8.00E-44 


GLSLRAT 


GLUTAMINASE, 
LIVER ISOFORM 
PRECURSOR (EC 
3.5.1.2) 

(GLS)>GP:RATGAH_ 
1 Rattus norvegicus L- 
glutamine 
amidohydrolase 
mRNA, complete cds 


8.00E-29 
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367 


U95760 


Drosophila 
melanogaster 
strawberry notch 
(sno) mRNA, 
complete cds. 


5.00E-45 


DMU95760J 


Drosophila 
melanogaster 
strawberry notch (sno) 
mRNA, complete cds; 
Notch pathway 
component; nuclear 
protein 


4.80E-45 


368 


LlOlOo 


Mus musculus 
protein tyrosine 
phosphate 
mRNA, complete 
cds. 


4.10E-45 


PTPK HUMA 
N 


PROTEIN-TYROSINE 
PHOSPHATASE 
KAPPA PRECURSOR 
(EC 3. 1. 3.48) (R-PTP- 
KAPPA)>GP:HSPTPK 
AP I H;sapiens mRNA 
for phosphotyrosine 
phosphatase kappa; 
Human 

phosphotyrosine 
phosphatase kappa 


4.70E-16 


369 


D17218 


Human HepG2 3' 
region Mbol 
cDNA, clone 
hmd3g02m3. 


9.40E-47 


MMU53563J 


Mus musculus Brgl 
mRNA, partial cds; N- 
terminal region of the 
protein 


0.00012 


370 


U78310 


Homo sapiens 
pescadillo 
mRNA, complete 
cds. 


8.10E-48 


HSU78310J 


Homo sapiens 
pescadillo mRNA, 
complete cds 


1.10E-21 


371 


AC000399 


Genomic 
sequence from 
Mouse 9, 
complete 
sequence. 


7.40E-48 


KIP2_YEAST 


KINESIN-LIKE 
PROTEIN 
KIP2>PIRl:C42640 
kinesin-related protein 
KIP2 - yeast 
(Saccharomyces 

2XVI 2 S;cerevisiae 
PEP4 and KIP2 genes 
encoding PEP4 
proteinase (partial) and 
kinesin-related protein 
KIP2>GP:SCLACHX 
VI_17S;cerev 


0.14 


372 


AC002327 


*** 

SEQUENCING 
IN PROGRESS 
*** Genomic 
sequence from 
Mouse 7; HTGS 
phase l , 3 


1.40E-48 


CHKC1A205 
l 


Chicken alpha-2 type-1 
collagen; amino acids - 
16 to 3; Precollagen 
alpha-2 


0.024 
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unordered pieces. 










373 


X67016 


H.sapiens mRNA 
for amphigiycan. 


9.00E-49 


CED2085_2 


Caenorhabditis elegans 
cosmid D2085, 
complete sequence; 
D2085;l; Similar to 
glutamine-dependent 
carbamoyl-phosphate 
synthase, aspartate 
carbamoyltransferase, 
dihydroorotase; cDNA 
EST 

cml6f3>GP:CED2085 
_2 Caenorhabditis 
elegans cosmid D2085; 
D 


0.14 


374 


LI 0409 


Mouse fork head 
related protein 
(HNF-3beta) 
mRNA, complete 
cds. 


1.50E-49 


MMU04197J 


Mus musculus HNF3 
beta transcription factor 
(HNF3b) mRNA, 
partial cds; Sequence of 
this partial cDNA 
begins in the first third 
\ji UK couberveu 
HNF3/forkhead DNA 
binding domain 


1.20E-30 


375 


U01139 


Mus musculus 
B6D2F1 clone 
2C11B mRNA. 


1.20E-49 


SPBC3D5J4 


S;pombe chromosome 
II cosmid c3D5; 
Unknown; 
SPBC3D5;14c, 
unknown; partial; 
serine rich, len:309aa, 
similar eg; to 
YNL283C, 
YN23_YEAST, 
P53832, hypothetical 
52;3 kd protein, 
(503aa), 


0.00091 


376 


Z82170 


Human DNA 
sequence from 
PAC 326L13 
containing brain- 
4 mRNA ESTs 
and polymorphic 
CA repeat. 


9.00E-50 


BSU55043J 


Bacillus subtilis 
plasmid pPOD2000 
Rep, RapAB, RapA, 
ParA, ParB, and ParC 
genes, complete cds; 
ORF3 


0.025 
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377 


Z99289 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
142L7; HTGS 
phase 1 . 


7.70E-50 


A64431 


hypothetical protein 
MJ1050- 
Methanococcus 
jannaschii>GP:MJU67 
548J2 Methanococcus 
jannaschii from bases 
986219 to 996377 
(section 90 or 1 50) of 
the complete genome; 
M; jannaschii predicted 
coding region MJ1050; 
Identified by 
GeneMark; putativ 


5.60E-05 


378 


X98260 


H.sapiens mRNA 
for M-phase 
phosphoprotein, 
mppll. 


6.20E-50 


ZRF1 MOUS 
E 


ZUOTIN RELATED 
FACTOR>GP:MMU53 
208_1 Mus musculus 
zuotin related factor 
(ZRFl)mRNA, 
complete cds; Similar 
to DnaJ encoded by 
GenBank Accession 
Number LI 6953 


3.90E-30 


379 


Ml 8981 


Human prolactin 
receptor- 
associated protein 
{rtsj\) gene, 
complete cds. 


9.00E-52 


SI 06 HUMA 
N 


CALCYCLIN 
(PROLACTIN 
RECEPTOR 
AooULlA ILL) 
PROTEIN) (PRA) 
(GROWTH FACTOR- 
INDUCIBLE 
PROTEIN 2A9) (SI 00 
CALCIUM-BINDING 
PROTEIN 
A6)>PIR1:BCHUY 
calcyclin - 

human>GP:HUMCAC 
Y_l Human calcyclin 
gene, complete 
cds>GP:HUMCACYA 
_1 Human prolactin 
recept 


8.80E-24 


380 


AB006622 


Homo sapiens 
mRNA for 
KIAA0284 gene, 
partial cds. 


1.60E-53 


S33015 


hypothetical protein - 
human herpesvirus 4 


0.00088 
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381 


U53225 


Human sorting 

npyin 1 /CMYn 

mRNA, complete 
cds. 


1.80E-55 


G02522 


sorting nexin 1 - 
numan>ur.ribU5J2zj 

1 Human sorting 
nexin 1 (SNX1) 
mRNA, complete cds 


9.20E-50 


382 


Z92844 


Human DNA 
sequence from 
PAC 435C23 on 
chromosome X. 
Contains ESTs. 


6.50E-56 


D 14487 J 


Lentinus edodes 
Le;MFBl mRNA, 
complete cds 


1 


383 


D87450 


Human mRNA 
tor KJAAUZOI 
gene, partial cds. 


4.30E-56 


D87450J 


Human mRNA for 
K1AA0261 gene, 
partial cds; Similar to 
D;melanogaster parallel 
sister chromatids 
protein 


4.30E-30 


384 


AC002301 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
chromosome 
+ 16pll.2 BAC 
clone CIT987SK- 

n JiiOAj, 1 1 1 vjo 

phase 2, 1 
ordered pieces. 


9.80E-57 


S62328 


kinesin-like DNA 
binding protein KID - 
human>GP:HUMKID_ 
1 Human mRNA for 
Kid (kinesin-like DNA 
binding protein), 
complete cds 


2.60E-27 


385 


L29766 


Homo sapiens 
epoxide hydrolase 
(EPHX) gene, 
complete cds. 


7.30E-57 


HSBCTCF4J 


Homo sapiens mRNA 
for hTCF-4 


2.30E-05 


386 


U58884 


Mus musculus 
SH3-containing 
protein SH3P7 
mRNA, complete 
cds. similar to 
Human Drebrin. 


3.30E-58 


MMU58884J 


Mus musculus SH3- 
containing protein 
SH3P7 mRNA, 
complete cds; similar to 
Human Drebrin; SH3- 
containing protein; 
similar to human 
drebrin 


6.00E-43 


387 


Y 15054 


Rattus norvegicus 
mRNA for 70 
kDa tumor 
specific antigen, 
partial. 


9.50E-59 


RNY15054J 


Rattus norvegicus 
mRNA for 70 kDa 
tumor specific antigen, 
partial; 70 kD tumor- 
specific antigen 


4.70E-45 
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m 

ID 
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P 
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P 
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388 


AC000406 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome 1 1 
overlapping pacs 
pDJ235kl0and 
pDJ239b22; 
HTGS phase 1, 
1 7 unordered 
pieces. 


7.40E-59 


<NONE> 


<NONE> 


<NONE> 


389 


L42612 


Homo sapiens 
keratin 6 isoform 
K6f (KRT6F) 
mRNA, complete 
cds. 


3.60E-59 


KRHUEA 


keratin, type II 
cytoskeletal - human 
(fragment)>GP:HSKE 
RA_1 Human 
messenger fragment 
encoding cytoskeletal 
keratin (type II); 
mRNA from cultured 
epidermal cells from 
human 

foreskin>GP:HUMKE 
KjoK._1 Human 5ok 
cytoskeletal type II 
keratin mRNA 


7.60E-30 


390 


L29766 


Homo sapiens 
cpoxiue nyuroiase 
(EPHX)gene, 
complete cds. 


2.70E-60 


EGR2_HUMA 

IN 


EARLY GROWTH 

DCCDAXTCC DDATCIXl 

2 (EGR-2) (KROX-20 
PROTEIN) 

(AT591)>GP:HUMEG 
R2A_1 Human early 
growth response 2 
protein (EGR2) 
mRNA, complete 
cds>TFD:TFDP00485 
- Polypeptides entry for 
factor Egr-2 


7.80E-06 


391 


L08758 


Mus musculus 
homeobox protein 
(Hox A 10) gene, 
5' end of cds. 


1.40E-60 


PAALGYGE 
N_l 


P;aeruginosa algY 
gene; Alginate lyase 


0.00031 


392 


129058 


Sequence 3 from 
patent US 
5576423. 


4.20E-61 


JC5106 


stromal cell-derived 
factor 2 - 

human>GP:D50645J 
Human mRNA for 
SDF2, complete cds; 
Stroma cell-derived 


1.50E-32 
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Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


ID 
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P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 












factor-2 




393 


129058 


Sequence 3 from 
patent US 
5576423. 


4.20E-61 


JC5106 


stromal cell-derived 
factor 2 - 

human>GP:D50645_l 
Human mRNA for 
SDF2, complete cds; 
Stroma cell-derived 
factor-2 


1.50E-32 


394 




beta-mannosidase 
mRN A, complete 
cds. 






Capra hi reus beta- 
mannosidase mRNA, 
complete cds 


I. /Ub-39 


395 


U40747 


Mus musculus 
formin binding 
protein 1 1 
mRNA, partial 
cds. 


6.90E-63 


S64713 


formin binding protein 
1 1 - mouse 

(fragment)>GP:MMU4 
0747_1 Mus musculus 
formin binding protein 
1 1 mRNA, partial cds; 

FRP 1 1 * Fnrmin 

binding protein 11; 
tandem WWP/WW 
domains separated by 
15 amino acid linker 


3.00E-46 


396 


M36164 


Human 

glyceraldehyde-3- 
phosphate 
dehydrogenase 
mRNA, 3' flank. 


1.10E-63 


BHT1UL_12 


Bovine herpesvirus 
type 1 UL22-35 genes; 
UL26;5>GP:BHU3180 
9 2 Bovine herpesvirus 
1 maturational 
proteinase (UL26) 
gene, complete cds, and 
scaffold protein 
(UL26;5) gene, 
complete cds 


0.003 


397 


Y09036 


H.sapiens 
NTRK1 gene, 
exon 17. 


7.30E-65 


MMU39060_1 


Mus musculus 
glucocorticoid receptor 
interacting protein 1 
(GRIP 1) mRNA, 
complete cds; 
Hormone-dependent 
interaction with 
hormone binding 
domains of steroid 
receptors; 


0.0054 
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P 
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transactivation 




398 


U17901 


Rattus norvegicus 
phospholipase A- 
2-activating 
protein (plap) 
mRNA, complete 
cds. 


2.70E-70 


JC4239 


phospholipase A2- 
activating protein - rat 


8.40E-17 


399 


D 12646 


Mouse kif4 
mRNA for 
microtubule- 
based motor 
protein KIF4, 
cumpicie cus. 


1.70E-74 


KIF4 MOUS 
E 


KINESIN-LIKE 
PROTEIN 
KIF4>PIR2:A54803 
microtubule-associated 
motor KIF4 - 

m/MinAS.PD.h>rT TCI/' IT? A 

mouse>0 r : M U bK. IF 4_ 
1 Mouse kif4 mRNA 
for microtubule-based 
motor protein KIF4, 
complete cds; ATP- 
binding site: base980- 
1037, motor domain: 
base732- 1781, alpha- 
betical CO 


1.10E-44 


400 


AF007860 


Xenopus laevis 
xl-Mago mRNA, 
complete cds. 


4.60E-75 


AF007862J 


Mus musculus mm- 
Mago mRNA, 
complete cds; Similar 
to Drosophila 
melanogaster Mago 
protein 


6.50E-68 


401 


145565 


Sequence 15 from 
patent US 
5637463. 


2.30E-82 


RNU57391J 


Rattus norvegicus 
FceRI gamma-chain 
interacting protein 
SH2- B (SH2-B) 
mRNA, complete cds; 
Putative FceRI gamma 
ITAM interacting 
protein; SH2 domain- 
containing protein B; 
Method: conceptual 


9.90E-42 
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P 
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402 


U29156 


Mus musculus 
epslSRmRNA, 
complete cds. 


1.0OE-85 


MMU29156J 


Mus musculus epsl5R 
mRNA, complete cds; 
Involved in signaling 
by the epidermal 
growth factor receptor; 
Method: conceptual 
translation supplied by 
author 


4.90E-62 


403 


U70139 


Mus musculus 
putative CCR4 
protein mRNA, 
nnrtiai cds 


1.0OE-85 


MMU70139J 


Mus musculus putative 
CCR4 protein mRNA, 
partial cds; Similar to 
ycdM iranscnpiion 
factor CCR4; 
transcriptional 
readthrough occurs 
with transcription being 
initiated at the IAP and 
continues 


7.20E-66 


404 


U82626 


Rattus norvegicus 
basement 
membrane- 
associated 
chondroitin 
proteoglycan 
Bamacan mRNA, 
complete cds. 


7.60E-96 


RNU82626_1 


Rattus norvegicus 
basement membrane- 
associated chondroitin 
proteoglycan Bamacan 
mRNA, complete cds; 
Chondroitin sulfate 
proteoglycan; CSPG 


8.20E-58 
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SE0 
ID 
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P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


405 


L09604 


Homo sapiens 
differentiation- 
dependent A4 
protein mRNA, 
complete cds. 


2.00E-35 


<NONE> 


<NONE> 


<NONE> 


406 


AB0005I6 


Homo sapiens 
mRNA for DSIF 
p 1 60, complete 
cds 


0.41 


POLGJTUMVQ 


GENOME 
POLYPROTEIN 
(CONTAINS: N- 
TERMINAL 
PROTEIN; HELPER 
COMPONENT 
PROTEINASE (EC 
3.4.22.-) (HC-PRO); 
42-50 KD 
PROTEIN; 
CYTOPLASMIC 
INCLUSION 
PROTEIN (CI); 6 
KD PROTEIN; VPG 
PROTEIN; 
NUCLEAR 
INCLUSION 
PROTEIN A (NI-A) 


2.9 


407 


Z94753 


Human DNA 
sequence from 
PAC 465G10on 
chromosome X 
contains Menkes 
uisease \j\ irf/\) 
putative Cu++- 
transporting P- 
type ATPase 
exons 22 21 and 
STS 


0.004 


<NONE> 


<NONE> 


<NONE> 


408 


AB011123 


Homo sapiens 
mRNA for 
KIAA0551 
protein, partial 
cds 


0 


MI15_CAEEL 


Q23356 

caenorhabditis 

elegans, 

serine/threonine- 
protein kinase mig- 
15 (ec 2.7.1.-). 11/98 


2.00E-51 


409 


D17218 


Human HepG2 3' 
region Mbol 
cDNA, clone 
hmd3g02m3 


e-123 


NARG_BACSU 


NITRATE 
REDUCTASE 
ALPHA CHAIN (EC 
1.7.99.4) 


9.9 


410 


M95098 


Bos taurus 
lysozyme gene 
(cow 2), complete 
cds 


1.1 


HAIRJvlOUSE 


HAIRLESS 
PROTEIN 


8.00E-10 
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411 


Z60048 


H.sapiens CpG 
DNA, clone 
1 87a9, reverse 
read 

cpgl87a9.rtla. 


4.00E-54 


HN3B_MOUSE 


HEPATOCYTE 
NUCLEAR 

FAPTHR ^ RPTA 
r/\l^ 1 UK J-DL 1 A 

(HNF-3B) 


4.00E-21 


412 


Z48975 


P.magnus gene 

for nrotein urPAR 

Ivl pi v IV 111 (ill rlLI 


0.014 


YPT2_CAEEL 


HYPOTHETICAL 

£. i .0 rKU 1 cliN 
F37A4.2 IN 
CHROMOSOME III 


2.00E-12 


413 


AJ001296 


Notophthalmus 

viride^cens 
mRNA for 
cytokeratin 8 


0.37 


YA53_SCHPO 


HYPOTHETICAL 
K.U rKOl EIN 
C13A11.03 IN 
CHROMOSOMF I 


5.00E-21 


414 


J03831 


Xenopus laevis 
(clone pXEC1.3) 
C protein mRNA, 
complete cds. 


0.37 


PDR5_YEAST 


SUPPRESSOR OF 
TOXICITY OF 
SPORIDESMIN 


3.3 


415 


AB007157 


Homo sapiens 
gene for 

ribosomal protein 
S21, partial cds 


e-142 


RS21_HUMAN 


40S RIBOSOMAL 
PROTEIN S21 


0.002 


416 


X86340 


H.sapiens C7 
gene, exon 13 


3.3 


STC DROMF 


SHI ITTT F PR A FT 

PROTEIN 


H.J 


417 


U 12404 


Human Csa-19 
mRNA, complete 
cds. 


0 


R10AJMG 


60S RIBOSOMAL 
PROTEIN LI OA 
(CSA-19) 


9.00E-57 


418 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


419 


M80198 


Human FKBP-12 
pseudogene, clone 
lambda-512, 5' 
flank and 
complete cds. 


5.00E-14 


RCO!_NEUCR 


TRANSCRIPTIONA 
L REPRESSOR 


0.008 


420 


AF052573 


Homo sapiens 
DNA polymerase 
eta(POLH) 
mRNA, complete 
cds 


0 


<NONE> 


<NONE> 


<NONE> 


421 


AF035940 


Homo sapiens 
MAGOH mRNA, 
somplete cds 


e-131 


MGN DROME 


MAGONASHI 
PROTEIN 


4.00E-39 


422 


AF054994 

< 
i 


-iomo sapiens 
slone 23832 
nRNA sequence 


0.12 


<NONE> 


<NONE> 


<NONE> 
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423 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 

44 mRNA nartial 

cds 


6.00E-05 


<NONE> 


<NONE> 


<NONE> 


424 


U95094 


Xenopus laevis 

XL-INCENP 

(XL-INCENP) 

rft D "\] A pAmnlprp 

iiii\r\/\, cuiupicie 
cds 


7.00E-07 


<NONE> 


<NONE> 


<NONE> 


425 


D43952 


Mouse gene for 
reticulocalbin, 
exonl and 
promoter region 


0.36 


<NONE> 


<NONE> 


<NONE> 


426 




C.elegans 
repetitive DNA 
sequence 


0.4 


TCB1_RABIT 


T-CELL 

RECEPTOR BETA 
CHAIN 

rKt-UUKoUK (ANA 

id 


0.1 1 


427 


M833I4 


Tomato 
phenylalanine 
ammonia jyase 
(pal) gene, 
complete cds and 
promoter region. 


3.3 


SMB2_HUMAN 


DNA-BINDING 
PROTEIN SMUBP-2 

(GLIAL r AC I OR- 1 ) 
(GF-1) 


0.65 


428 


AF070636 


Homo sapiens 
clone 24686 
mRNA sequence 


5.00E-23 


<NONE> 


<NONE> 


<NONE> 


429 


<NONE> 


<NONE> 


<NONE> 


IQGA_HUMAN 


RAS GTPASE- 
AL 11 VA UNO- 
LIKE PROTEIN 
IQGAP1 (PI 95) 


2.00E-06 


430 


AF068627 


Mus musculus 
DNA cytosine-5 

trt ^thv 1 fro n cfpra c*» 

iiicujyiirajiMcraoC 
3B2 (Dnmt3b) 
mRNA, 
alternatively 
spliced, complete 
cds 


5.00E-04 


LOXl_LENCU 


LIPOXYGENASE 
(EC 1.13.11.12) 


9.9 


431 


AF020043 


Homo sapiens 
chromosome- 
associated 
polypeptide 


0 


YJH4YEAST 


HYPOTHETICAL 
141.3 KD PROTEIN 
IN SCP160-MRPL8 
INTERGENIC 
REGION 


4.00E-16 


432 


K00046 


ross river virus 
26s subgenomic 
rna and junction 
region. 


0.12 


CUL2_HUMAN 


CULLIN 
HOMOLOG 2 
(CUL-2) 


7.4 
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433 


AF005664 


Homo sapiens 
properdin (PFC) 
gene, complete 
cds 


0.005 


UL88_HCMVA 


PROTEIN UL88 


5.8 


434 


Z70705 


H. sapiens mRNA 
(fetal brain cDNA 
com5) 


2 00E-05 


PH87 YEAST 


INORHANir 

J l N KJ IX KJ r\ l y 1 V_» 

PHOSPHATE 

TRANSPORTER 

PH087 


i c 

1 .D 


435 


U29156 


epslSRmRNA, 
complete cds. 




FP1S HUMAN 


GROWTH FACTOR 
RECEPTOR 
SUBSTRATE 
SUBSTRATE 15 

fPROTFTlM FPQ1 O 

(AF- IP PROTEIN) 


1 .UUt>i J 


436 


AE000750 


Aquifex aeolicus 
section 82 of 109 
of the complete 


0.37 


<NONE> 


<NONE> 


<NONE> 


437 


U49169 


Dictyostelium 
discoideum V- 
ATPase A subunit 
(vatA) mRNA, 
complete cds 


0.12 


VCAP_HSV6U 


MAJOR CAPSID 
PROTEIN (MCP) 


5.6 


438 


AF032871 


Homo sapiens 
uncoupling 
protein 3 (UCP3) 
gene, exon 1 and 
partial exon 2 


0.13 


WEEl_SCHPO 


MITOSIS 
INHIBITOR 
PROTEIN KINASE 
WEE1 (EC 2.7.1.-) 


3.7 


439 


AB000425 


Porcine DNA for 
endopeptidase 
24.16, exon 16 
and complete cds 


4.00E-32 


<NONE> 


<NONE> 


<NONE> 


440 


U51037 


Mus musculus 11- 
zinc-finger 
transcription 
factor 


0.04 


<NONE> 


<NONE> 


<NONE> 


441 


AF032456 


Homo sapiens 
ubiquitin 
conjugating 
enzyme G2 


e-110 


<NONE> 


<NONE> 


<NONE> 


442 


AF009288 


Homo sapiens 
clone HEB8 Cri- 
du-chat region 
mRNA 


2.00E-14 


LMG1_HUMAN 


LAMININ 
GAMMA- 1 CHAIN 
PRECURSOR 
(LAMININ B2 
CHAIN) 


8.1 
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443 


AF024578 


Homo sapiens 
type-1 protein 
phosphatase 
skeletal muscle 
glycogen 
targeting subunit 
(PPPlR3)gene, 
exon 4, and 
complete cds 


1.1 


<NONE> 


<NONE> 


<NONE> 


444 


M24486 


Human prolyl 4- 
hydroxylase alpha 
subunit mRNA, 
complete cds, 
clone PA-11. 


0 


DACHA 


<NONE> 


4.00E-58 


445 


X96400 


P.tetraurelia 
alpha-51D gene 


0.37 


<NONE> 


<NONE> 


<NONE> 


446 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


447 


X84996 


X.laevis mRNA 

frtr c#*lf*nrvr*VQtpinf* 

1U1 Y OLGIUC 

tRNA acting 
factor (Staf) 


0.12 


POLMLVRD 


POL 

rUL I r IvvJ 1 HUN 

(PROTEASE (EC 
3.4.23.-); REVERSE 
TRANSCRIPTASE 
(EC 211 49V 
RIBONUCLEASEH 
(EC 3.1.26.4)) 


2.00E-08 


448 


AFO 19980 


Dictyostelium 
discoideum ZipA 
(zipA) gene, 
partial cds 


3.4 


HMDLJ3RAFL 


HOMEOBOX 
PROTEIN DLL 
HOMOLOG 


0.23 


449 


X78424 


D.carota (Queen 
Anne's Lace) 
Inv*Dc2 gene, 
3432bp 


0.38 


<NONE> 


<NONE> 


<NONE> 




<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


451 


X89886 


P.patens mRNA 
for 5- 

aminolevulinate 
dehydratase 


1.1 


CKR6_HUMAN 


C-C CHEMOKINE 
RECEPTOR TYPE 6 
(C-C CKR-6) 
(CCR6) 


9.9 


452 


U67471 


Methanococcus 
jannaschii section 
13 of 150 of the 
complete genome 


0.12 


YR72_ECOLI 


HYPOTHETICAL 
53.2 KD PROTEIN 
(ORF2) (RETRON 
EC67) 


5.8 


453 


AF060246 


Mus musculus 
strain C57BL/6 
zinc finger protein 
106 (Zfpl06) 
mRNA, H3a-a 
allele, complete 
cds 


1.00E-62 


YOJ8_CAEEL 


HYPOTHETICAL 
51.6 KD PROTEIN 
ZK353.8 IN 
CHROMOSOME III 


1.7 
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454 


U70667 


Human Fas-ligand 
associated factor 
1 mRNA, partial 
cds 


0 


YKB2_YEAST 


HYPOTHETICAL 
69.1 KD PROTEIN 
IN PUT3-CCE1 
INTERGENIC 
REGION 


3.00E-09 


455 


M95858 


Bos taurus 
recoverin mRNA, 
complete cds. 


0.35 


GIDAMYCGE 


GLUCOSE 
INHIBITED 
DIVISION 
PROTEIN A 


1.4 


456 


U67594 


Methanococcus 
jannaschii section 
136 of 1 50 of the 
complete genome 


0.36 


<NONE> 


<NONE> 


<NONE> 


457 


X06747 


Human hnRNP 
core protein Al 


3.00E-31 


<NONE> 


<NONE> 


<NONE> 


458 


265575 


H.sapiens CpG 
DNA, clone 47c5, 

fP\/PfCP rporl 

rcvcidC I CaU 

cpg47c5.rtla . 


1.3 


<NONE> 


<NONE> 


<NONE> 


459 


X88893 


C.jacchus intron 4 
of visual pigment 
gene 


5.00E-15 


<NONE> 


<NONE> 


<NONE> 


460 


M57426 


Maize stripe virus 
RNA3 
nonstructural 
protein 


0.33 


DSC2_MOUSE 


DESMOCOLLIN 
2A/2B 

PRECURSOR 
(EPITHELIAL 
TYPE 2 

DESMOCOLLIN) 


6.5 


461 


X01638 


Yeast TEF1 gene 
for elongation 
factor EF- 1 alpha 


1.1 


PPOLJDROME 


POLY (ADP- 
RIBOSE) 

POLYMERASE (EC 
2.4.2.30) (PARP) 


3.5 


462 


M60064 


S.typhimurium 
glutamate 1- 
semialdehyde 
aminotransferase 
(hemL) gene, 
complete cds. 


1.1 


EPB4_MOUSE 


EPHRIN TYPE-B 
RECEPTOR 4 
PRECURSOR (EC 
2.7.1.1 12) KINASE 
2) (TYROSINE 
KINASE MYK- 1) 


2.5 


463 


X51508 


Rabbit mRNA for 
aminopeptidase N 
(partial) 


0.36 


ACHG_XENLA 


ACETYLCHOLINE 
RECEPTOR 
PROTEIN, GAMMA 
CHAIN 
PRECURSOR 


1.5 


464 


L10106 


Mus musculus 
protein tyrosine 
phosphate 
mRNA, complete 
cds. 


2.00E-58 


VG13_BPML5 


GENE 13 PROTEIN 
(GP13) 


2.5 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


465 


M77235 . 


Human cardiac 
tetrodotoxin- 
insensitive 
voltage-dependent 
sodium channel 
alpha subunit 
(HHl)mRNA, 
complete cds. 


3.8 


ZPBOC1 


<NONE> 


6.9 


466 


M58330 


C.maltosa 
autonomously 
replicating 
sequence. 


0.004 


EPB4_MOUSE 


EPHRIN TYPE-B 
RECEPTOR 4 
PRECURSOR (EC 
2.7.1.112) KINASE 
2) (TYROSINE 
KINASE MYK-1) 


2.4 


467 


X51508 


Rabbit mRNA for 
aminopeptidase N 
(partial) 


0.35 


ACHG_XENLA 


ACETYLCHOLINE 
RECEPTOR 
PROTEIN, GAMMA 
CHAIN 
PRECURSOR 


2.4 


468 


L10106 


Mus musculus 
protein tyrosine 
phosphate 
mRNA, complete 
cds. 


7.00E-59 


VGLI_PRVRI 


GLYCOPROTEIN 
GP63 PRECURSOR 


4.3 


469 


U65939 


Azotobacter 
vinelandii GTPase 
(ftsA) gene, 
partial cds, and 
ATP binding 
protein (ftsZ) 
gene, complete 
cds 


1.1 


TRUA_BACSP 


Q45557 bacillus sp. 
(strain ksm-64). trna 
pseudouridine 
synthase a (ec 
4.2.1.70) 
(pseudouridylate 
synthase i) 
I'pseudouridine 
synthase i) (uracil 
hydrolyase). 11/98 


0.001 


470 


U51037 


Mus musculus 1 1- 
zinc-finger 
transcription 
factor 


0.041 


<NONE> 


<NONE> 


<NONE> 


471 


M32685 


Human platelet 
glycoprotein Ilia, 
exon 14. 


3.6 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


472 


U82691 


Phrynocephalus 
raddei CAS 
1 ly l l\j IN/YUri 
dehydrogenase 
subunit 1 (ND1), 
partial cds, tRNA- 
Gln, tRNA-Ile 
and tRNA-Met, 
NADH 

dehydrogenase 
subunit 2 tRNA- 
Cys and tRNA- 
Tyr and c... 


1.1 


<NONE> 


<NONE> 


<NONE> 


473 


D85430 


Mouse Murrl 
mRNA, exon 


0.12 


EPA5_CHICK 


EPHRIN TYPE-A 
RECEPTOR 5 
PRECURSOR (EC 
2.7.1.112) 


2.5 


474 


U20661 


Dictyostelium 
discoideum 
unknown internal 
repeat protein 
gene, complete 
cds, and unknown 
orfl,orf2 and 
orf3 genes, partial 
cds 


0.36 


YHL1JEBV 


HYPOTHETICAL 
BHLF1 PROTEIN 


4.00E-04 


475 


AJOjj / 


riuman novel 
homeobox mRNA 
for a DNA 
binding protein 


U.U4 


CAC 111 TX if a XI 

rA!>_HUIV1AN 


COAGULATION 
FACTOR V 
PRECURSOR 
(ACTIVATED 
PROTEIN C 
COFACTOR) 


9.5 


476 


U32843 


Haemophilus 
influenzae Rd 
section 158 of 163 
of the complete 
genome 


5 


<NONE> 


<NONE> 


<NONE> 


477 


U67554 


Methanococcus 
jannaschii section 
96 of 150 of the 
complete genome 


0.36 


<NONE> 


<NONE> 


<NONE> 


478 


AB004244 


Narkejaponica 
mRNA for Nj- 
synaphin lb, 
complete cds 


1.1 


NIAl_ORYSA 


NITRATE 
REDUCTASE 1 (EC 
1.6.6.1) (NR1) 


1.00E-07 


479 


AF075079 


hlomo sapiens full 
length insert 
cDNA YQ80A08 


1.00E-12 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


480 


AE000723 


Aquifex aeolicus 
section 55 of 109 
of the complete 
eenome 


1 


YKK0_YEAST 


HYPOTHETICAL 
67.5 KD PROTEIN 
IN APE1/LAP4- 
CWP1 

INTERGENIC 
REGION 


9.1 


481 


X73902 


H. sapiens mRNA 
for nicein B2 
chain 


0 


LMG2_HUMAN 


LAMININ 
GAMMA-2 CHAIN 
PRECURSOR 


3.00E-93 


482 


U95094 


Xenopus laevis 
XL-INCENP 

\ A L- 1 IN ClN r ) 

mRNA, complete 
cds 


3.00E-10 


P53_CRIGR 


CELLULAR 
TUMOR ANTIGEN 


5.7 


483 


ALO 10240 


Plasmodium 

falciparum DNA 
*** 

SEQUENCING 
IN PROGRESS 
*** from contig 
4-64, complete 
sequence 


1.2 


<NONE> 


<NONE> 


<NONE> 


484 


U49919 


Arabidopsis 
thaliana lupeol 
synthase mRNA, 
complete cds 


0.54 


YA53_SCHPO 


HYPOTHETICAL 
24.2 KD PROTEIN 
C13A11.03 IN 
CHROMOSOME I 


6.00E-10 


485 


AF077618 


Homo sapiens 
p73 gene, exon 3 


0.39 


MYODJvlOUSE 


MYOBLAST 
DETERMINATION 
PROTEIN 1 


2.1 


486 


AF054994 


Homo sapiens 
clone 23832 
mRNA sequence 


0.13 


<NONE> 


<NONE> 


<NONE> 


487 


U95102 


Xenopus laevis 
mitotic 

LfllUdUHV/Lsl ULGIU 

90 mRNA, 
complete cds 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


488 


AF068627 


Mus musculus 
DNA cytosine-5 
methyltransferase 
3B2 (Dnmt3b) 
mRNA, 
alternatively 
spliced, complete 
cds 


5.00E-04 


ACE2_YEAST 


METALLOTHIONE 
IN EXPRESSION 
ACTIVATOR 


1.5 


489 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-07 


RINI_PIG 


RIBONUCLEASE 
INHIBITOR 


0.19 



249 



WO 99/33982 



PCT/US98/27610 



Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


490 


L77886 


Human protein 
tyrosine 
phosphatase 
mRNA, complete 
cds 


1.00E-21 


VS48_TBRVS 


SATELLITE RNA 
48 KD PROTEIN 


1.6 


491 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


5.00E-04 


CRP3_LIMPO 


C-REACTIVE 
PROTEIN 3.3 
PRECURSOR 


3.5 


492 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


8.00E-08 


EPA5_CHICK 


EPHRIN TYPE-A 
RECEPTOR 5 
PRECURSOR (EC 


2.7 


493 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.00E-09 


<NONE> 


<NONE> 


<NONE> 


494 


U28153 


Caenorhabditis 
elegans UNC-76 
(unc-76) gene, 
complete cds. 


0.37 


<NONE> 


<NONE> 


<NONE> 


495 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


0.37 


NCPR_YEAST 


NADPH- 
CYTOCHROME 
P450 REDUCTASE 
(EC 1.6.2.4) (CPR) 


7.00E-05 


496 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.013 


YMB3_CAEEL 


PROBABLE 
INTEGRIN ALPHA 
CHAIN F54G8.3 
PRECURSOR 


3.3 


497 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-07 


<NONE> 


<NONE> 


<NONE> 


498 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-10 


<NONE> 


<NONE> 


<NONE> 


499 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-07 


VGLY_LYCVW 


GLYCOPROTEIN 

POLYPROTEIN 

PRECURSOR 

(CONTAINS: 

GLYCOPROTEINS 

Gl AND G2) 


3.2 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


500 


U95098 


Xenopus laevis 
mitotic 

1111 IU L 1 V 

phosphoprotein 
44 mRNA, partial 
cds 


8.00E-06 


HR78JDROME 


NUCLEAR 

HOR MOMF 
ki \j v i \j i n o 

RECEPTOR HR78 

(DHR78) 

(NUCLEAR 

RECEPTOR 

XR78E/F) 


2.5 


501 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


9.00E-10 


MYSH_BOVIN 


MYOSIN I HEAVY 
CHAIN-LIKE 
PROTEIN (MIHC) 
(BRUSH BORDER 
MYOSIN I) (BBMI) 


4.00E-04 


502 


U95094 


Xenopus laevis 
XL-INCENP 

{ AL- 1 IN U tlN r ) 

mRNA, complete 
cds 


2.00E-04 


BAL_HUMAN 


BILE-SALT- 
ACTIVATED 
LIPASE 

PRECURSOR (EC 
3.1.1.3) (EC 
3.1.1.13) (BAL) 
(BILE-SALT- 
STIMULATED 
LIPASE) (BSSL) 
ESTERASE) 

/r> A XlpD I? A TIP 

(rAiNCKfcA 1 1C 

LYSOPHOSPHOL1P 

ASE) 


2.6 


503 


AF080399 


Drosophila 
melanogaster 
mitotic 
checkpoint 
control protein 
kinase BUB 1 
(Bubl)mRNA, 
complete cds 


LI 


NAT1JYEAST 


N-TERMINAL 
ACETYLTRANSFE 
RASE 1 (EC 
2.3.1.88) 


2.00E-23 


504 


U59706 


Gallus gallus 
alternatively 
snliced AMPA 
glutamate 
receptor, isoform 
GluR2 flop, 
(GluR2) mRNA, 
partial cds. 


0.014 


<NONE> 


<NONE> 


<NONE> 


505 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


2.00E-05 


<NONE> 


<NONE> 


<NONE> 


506 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-04 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


507 


AF100661 


Caenorhahditi*; 
elegans cosmid 
H20E11 


0.38 




^rNL/rNc^ 




508 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA 
complete cds 


3.00E-11 


CA1A_HUMAN 


COLLAGEN 
ALPHA 1(X) 
CHAIN 

DD COT TO Qf~\D 


0.024 


509 


U47322 


Cloning vector 
DNA, complete 
sequence. 


2.00E-38 


COA1_SV40 


COAT PROTEIN 
VPI 


6.2 


510 


AF031924 


Homo sapiens 
homeobox 
transcription 
factor barx2 


e-156 


CCMA_HAE1N 


HEME EXPORTER 
PROTEIN A 
(CYTOCHROME C- 
TYPE BIOGENESIS 
ATP-BINDING 
PROTEIN CCMA) 


3.5 


511 


AFO 10484 


Homo sapiens ICI 
YAC 9IA12, right 
end sequence 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


512 


Z63829 


H.saDiens CoG 
DNA, clone 90h2, 
forward read 
cpg90h2.ftla. 


5 00E-22 


NFIR MP^Al I 
is r i rv_Lvii_,or\ \j 


mt in pad 

FACTOR 1 CLONE 
PNF1/RED1 (NF-I) 
(CCAAT-BOX 
BINDING 
TRANSCRIPTION 
FACTOR) (CTF) 
(TGGCA-BINDING 
PROTEIN) 




513 


235094 


H.sapiens mRNA 
forSURF-2 


5.00E-97 


SUR2_HUMAN 


SURFEIT LOCUS 
PROTEIN 2 




514 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-06 


<NONE> 


<NONE> 


<NONE> 


515 


D38417 


Mouse mRNA for 
arylhydrocarbon 
receptor, 
complete cds 


e-154 


TEGU_EBV 


LARGE 

TEGUMENT 

PROTEIN 


3.4 


516 


L10911 


Homo sapiens 
splicing factor 
(CC1.4) mRNA, 
complete cds. 


e-117 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 

TPT 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


XI 7093 


Human HLA-F 

opfip fnr hitman 
gcuc lkji Human 

leukocyte antigen 
F 


0.009 


YENl_SCHPO 


013695 

schizosaccharomyces 
pombe (fission 
yeast), hypothetical 
52.9 kd serine-rich 
protein cl I g7.01 in 
cnromosome 1. J i/yo 


5.4 


518 


ABO 17026 


Mus musculus 
mRNA for 
oxysterol-binding 
protein, complete 
cds 


0 


OXYBHUMAN 


OXYSTEROL- 
BINDING PROTEIN 


1.00E-40 


519 


X55038 


Mouse mCENP-B 
gene for 
centromere 
autoantigen B 


0.001 


YNW7_YEAST 


HYPOTHETICAL 
68.8 KD PROTEIN 
IN URE2-SSU72 
INTERGENIC 
ivbUlUN 


3.00E-04 


520 


ABO 18323 


Homo sapiens 
mRNA for 
KIAA0780 

pruLCUl, pallia! 

cds 


3.00E-41 


LBR_CHICK 


LAMIN B 
RECEPTOR 


2.3 


521 


U95094 


Xenopus laevis 

XL-INCENP 

(XL-INCENP) 

mt?7\I A primrtlAti* 
llirvlNrV, l>UIIipiCLC 

cds 


1.00E-10 


CA25_HUMAN 


PROCOLLAGEN 
ALPHA 2(V) 
CHAIN 

rKcL UKoUK 


0.002 


522 


X03558 


Human mRNA 
for elongation 
factor 1 alpha 

Qiihiinit 

i}UU 111 111 


0 


EF1 1_HUMAN 


ELONGATION 
FACTOR 1 -ALPHA 
1 (EF-1 -ALPHA- 1) 


e-110 


523 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-11 


YMT8_YEAST 


HYPOTHETICAL 
36.4 KD PROTEIN 
IN NUP116-FAR3 
INTERGENIC 
REGION 


8.00E-07 


524 


AB014591 


Homo sapiens 
mRNA for 
KIAA0691 
protein, complete 
cds 


0 


NOT2JYEAST 


GENERAL 
NEGATIVE 
REGULATOR OF 
TRANSCRIPTION 
SUBUNIT 2 


8.00E-05 


525 


AB019488 


Homo sapiens 
DNA for TRKA, 
exon 17 and 
complete cds 


0 


TRKA_HUMAN 


HIGH AFFINITY 

NERVE GROWTH 

FACTOR 

RECEPTOR 

PRECURSOR 

PROTEIN) (PI 40- 

TRKA) 


2.00E-27 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


526 


uyr> iuz 


Aenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


c nnp i ^ 

j.UUc- 1 j 


N \J *r__D \J V 1 IN 


1A(\Y PROTPIW HF 
ZHKJlS. rtsXJ 1 ClfN Ur 

ROD 

PHOTORECEPTOR 

CNG-CHANNEL 

CYCLIC- 

INUvvL/EAJl I L/C- 

GATED CATION 
CHANNEL 4 (CNG 
CHANNEL 4) 
MODULATORY 
SUBUNIT)) 




527 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


2.00E-06 


HMZ1JDROME 


ZERKNUELLT 
PROTEIN 1 (ZEN- 1) 


0.88 


528 


J03750 


Mouse single 
stranded DNA 
binding protein p9 
mRNA, complete 
cds. 


e-135 


P15_HUMAN 


ACTIVATED RNA 
POLYMERASE II 
TRANSCRIPTIONA 
L COACTIVATOR 
P15(PC4) (P14) 


3.00E-21 


529 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 


1.00E-12 


RS5_DROME 


40S RIBOSOMAL 
PROTEIN S5 


0.42 


530 


Z57610 


H.sapiens CpG 
DNA, clone 
187al0, reverse 
read 


8.00E-61 


HN3B_MOUSE 


HEPATOCYTE 
NUCLEAR 
FACTOR 3-BETA 
(HNF-3B) 


4.00E-15 


531 


U95760 


Drosophila 
melanogaster 
strawberry notch 
fanrA mRNA 

\JH\Jf 1111 VI j 

complete cds 


3.00E-60 


<NONE> 


<NONE> 


<NONE> 


532 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


4.00E-11 


<NONE> 


<NONE> 


<NONE> 


533 


U50535 


Human BRCA2 
region, mRNA 
sequence CG006 


4.00E-12 


ALU1_HUMAN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


1.1 


534 


X92841 


H.sapiens MICA 
gene 


1.00E-55 


LIN1_HUMAN 


LINE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG 


6.00E-09 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


m 

ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


535 


U60337 


Homo sapiens 
beta-mannosidase 
mRNA, complete 
cds 


0 


NODC_BRAEL 


N- 

ACETYLGLUCOSA 
MINYLTRANSFER 
ASE (EC 2.4.1.-) 


1.4 


536 


M21731 


Human lipocortin- 

V mRNA 

complete cds. 


e-169 


ANX5_HUMAN 


ANNEXIN V 

(ENDONEXIN II) 
(CALPHOBINDIN 
I) (CBP-I) 

ANTICOAGULANT 
PROTEIN I) (PAP-I) 
ANTICOAGULANT 
-ALPHA) (VAC- 
ALPHA) 

(ANCHORIN CII) 


1.00E-05 


537 


Y08013 


S.salar DNA 
segment 
containing GT 
repeat 


0.006 


<NONE> 


<NONE> 


<NONE> 


538 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


539 




\yfnc miicr*iiliic 

protein encoding 
twelve zinc finger 
proteins (pMLZ- 
4) mRNA, 
complete cds. 




nVNA CVWCK 


DYTsIAPTIN 117 

KD 1SOFORM 


7 4 


540 


U95102 


Xenopus laevis 

mitotic 

1111 L V* V 1 v 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-05 


HXA3_HAEIN 


HEME:HEMOPEXI 
N-BINDING 
PROTEIN 
PRECURSOR 


2.6 


541 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-13 


AMO_KLEAE 


AMINE OXIDASE 
PRECURSOR ( EC 
1.4.3.6) 

(MONAMINE 
OXIDASE) 
(TYRAMINE 
OXIDASE) 


1.5 


542 


AF083322 


Homo sapiens 
centriole 

associated protein 
CEPllOmRNA, 
complete cds 


e-133 


CA34_HUMAN 


PROCOLLAGEN 
ALPHA 3(IV) 
CHAIN 
PRECURSOR 


1.5 


543 


J03746 


Human 
glutathione S- 
transferase 
mRNA, complete 
cds. 


e-170 


GTMI_HUMAN 


GLUTATHIONES- 
TRANSFERASE, 
MICROSOMAL (EC 
2.5.1.18) 


5.00E-39 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


544 


U67522 


Methanococcus 
jannaschii section 
64 of 150 of the 
complete genome 


0.37 


A1AA_HUMAN 


ALPHA- 1 A 

ADRENERGIC 

RECEPTOR 


4.3 


545 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-07 


<NONE> 


<NONE> 


<NONE> 


546 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


547 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


548 


D8700! 


Human (lambda) 
DNA for 
immunoglobulin 
light chain 


0.35 


VAL3_TYLCU 


AL3 PROTEIN (C3 
PROTEIN) 


3.2 


549 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.00E-08 


TEGU_HSV11 


LARGE 
TEGUMENT 
PROTEIN (VIRION 
PROTEIN UL36) 


0.004 


550 


D16991 


Human HepG2 
partial cDNA, 
clone 

hmd2d01m5 


8.00E-09 


PTM1_YEAST 


PROTEIN PTM1 
PRECURSOR 


0.033 


551 


M34025 


Human fetal Ig 
heavy chain 
variable region 


3.2 


<NONE> 


<NONE> 


<NONE> 


552 


M98502 


Mus musculus 
protein encoding 
twelve zinc finger 
proteins (pMLZ- 
4) mRNA, 
complete cds. 


5.00E-14 


<NONE> 


<NONE> 


<NONE> 


553 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


554 


Z78730 


H.sapiens flow- 
sorted 

chromosome 6 
Hindlll fragment, 
SC6pA15C3 


3.00E-20 


ALU INHUMAN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
jit; 


5.00E-06 


555 


U74496 


Human 

chromosome 4q35 

subtelomeric 

sequence 


8.00E-08 


ICP4_VZVD 


TRANS-ACTING 
TRANSCRIPTIONA 
L PROTEIN ICP4 


0.39 
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&EQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


556 


U39875 


Rattus norvegicus 
EF-hand Ca2+- 
binding nrotein 
p22 mRNA, 
complete cds. 


2.00E-56 


YHFK_ECOLI 


HYPOTHETICAL 
79.5 KD PROTEIN 

INTERGENIC 
REGION (0696) 


9.8 


557 


U65416 


Human MHC 
class I molecule 
f MICB^ eene 
complete cds 


0.12 


<NONE> 


<NONE> 


<NONE> 


558 


AG000037 


Homo sapiens 
genomic DNA, 
21 q region, clone: 

QUI 1 A99 


5.00E-25 


<NONE> 


<NONE> 


<NONE> 


559 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


5.00E-05 


<NONE> 


<NONE> 


<NONE> 


560 


ARnn70i ft 


riomo sapiens 
mRNA for 
KIAA0449 
protein, partial 
cds 


u.U 1 J 


\ Tr~* it? rip-in i 

VuLb_H2>Vl 1 


GLYCOPROTEIN E 
PRECURSOR 


2.2 


561 


U58884 


Mus musculus 
SH3-containing 
protein SH3P7 
mRNA, complete 
cds similar to 
Human Drebrin 


1.00E-73 


YCV2_YEAST 


HYPOTHETICAL 
13.8 KD PROTEIN 
IN PWP2-SUP61 
INTERGENIC 


2.6 


562 


AB007878 


Homo sapiens 
KIAA0418 
mRNA, complete 
cds 


e-110 


GLU2_MAIZE 


GLUTELIN 2 

PRPPl IR^OR 
rlVuvi Ui\ov/I\ 

(ZEIN-GAMMA) 
(27 KD ZEIN) 


0.72 


563 


AF065482 


Homo sapiens 
sorting nexin 2 
(SNX2) mRNA, 
complete cds 


0 


YJD6JYEAST 


HYPOTHETICAL 

4Q 0 PROTFTM 
H7.U iv lv i rvvy 1 E/I IN 

IN NSP1-KAR2 

INTERGENIC 

REGION 


1.4 


564 


U27873 


Stealth virus 1 
clone 3B 11 T7 


0.002 


SYN1_HUMAN 


SYNAPSINS IA 
AND IB (BRAIN 
PROTEIN 4.1) 


1.6 


565 


L38951 


Homo sapiens 
importin beta 
subunit mRNA, 
complete cds 


2.00E-68 


VP2_BRD 


STRUCTURAL 
CORE PROTEIN 
VP2 


1.1 


566 


AF007155 


Homo sapiens 
clone 23763 
unknown mRNA, 
partial cds 


e-165 


YOHIjVZOVI 


HYPOTHETICAL 
33.2 KD PROTEIN 
IN IBPB 5'REGION 


7.5 



257 



WO 99/33982 



PCTYUS98/27610 



Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


567 


Z56295 


H.sapiens CpG 
DNA, clone 10c2, 
forward read 
cpgl0c2.ftla. 


0.12 


A1ABCANFA 


ALPHA- IB 
ADRENERGIC 
RECEPTOR 
(FRAGMENT) 


0.85 


568 


Z83792 


G.gallus 
microsatellite 
DNA (LEI0222 


0.12 


<NONE> 


<NONE> 


<NONE> 


569 


U11820 


Feline 

immunodeficienc 
y virus 

USIL2489J7B 
gag polyprotein 
(gag) gene, 
complete cds, 
polymerase 
polyprotein (pol) 
gene, partial cds, 
vif protein (vif), 
complete cds, and 
envelope 
glycoprotein 
(env), complete 
cds, complete g... 


1.1 


<NONE> 


<NONE> 


<NONE> 


570 


Ml 8065 


Mouse 18S and 

OfiQ riKncnmol 
Z-oo riDUSOulal 

DNA, 5' 
hypervariable 
(Vr) region, clone 
Ml. 


6.00E-04 


CC40_YEAST 


CELL DIVISION 
CONTROL 
PROTEIN 40 


3.7 


571 


/vl UJjUtJ 


nuiuu Sapicllb 

cellular apoptosis 
susceptibility 
protein (CSE1) 
gene, exons 3 
through 10 


o hop n7 




H YrOl Hhl 1CAL 
25.8 KD PROTEIN 
K02D10.4IN 
CHROMOSOME III 


4.3 


572 


X04588 


Human 2.5 kb 
mRNA for 
cytoskeletal 
tropomyosin 
TM30(nm) 


0 


<NONE> 


<NONE> 


<NONE> 


573 


AC001 159 


Homo sapiens 
(subclone 1 h9 
from PAC H92) 
DNA sequence 


5.00E-04 


XYNDCELFI 


ENDO-l,4-BETA- 
XYLANASED 
PRECURSOR (EC 
3.2.1.8) 


7.3 


574 


Z60625 


H.sapiens CpG 
DNA, clone 2c 10, 
forward read 
cpg2c!0.ftlaa. 


4.00E-13 


<NONE> 


<NONE> 


<NONE> 


575 


AF070640 


Homo sapiens 
clone 24781 


e-164 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VAI IJF 






mRNA sequence 










•576 


Yl 1306 


Homo sapiens 
mRNA for hTCF- 
4 


2.00E-48 


TCF INHUMAN 


T-CELL-SPECIFIC 
TRANSCRIPTION 
FACTOR 1 (TCF-1) 


2.00E-15 


577 


X65279 


pWE15 cosmid 
vector DNA 


7.00E-69 


OCLN_POTTR 


Q28793 potorous 
tridactylus (potoroo). 
occludin. 1 1/98 


0.71 


578 


Ml 0296 


Mouse DNA with 
homology to EBV 
IR3 repeat, 
segment 1, clone 
Mu2. 


0.001 


LMB1_HYDAT 


LAMININ BETA-1 
CHAIN 
PRECURSOR 
(FRAGMENTS) 


1.9 


579 


X53744 


Canine mRNA for 
68kDA subunit of 
signal recognition 
particle (SRP68) 


e-162 


SR68_CANFA 


SIGNAL 
RECOGNITION 
PARTICLE 68 KD 
PROTEIN (SRP68) 


5.00E-16 


580 


AF086438 


Homo sapiens full 
length insert 
cDNA clone 
ZD80G11 


2.00E-04 


<NONE> 


<NONE> 


<NONE> 


581 


in si 40 

\J 1 J 1 tu 


lviyLuoacienum 
bovis ribosomal 
proteins IF-1 
complete cds, and 
S4 (rpsD) gene, 
partial cds 


i .j 


<NUINti> 


<NONE> 


<NONE> 


582 


D 13292 


Human mRNA 
lur ryuuuLrd.li core 
protein 


e-166 


RSP4_ARATH 


40S RIBOSOMAL 

rKU I blN SA (P40) 
(LAMININ 
RECEPTOR 
HOMOLOG) 


1.4 


583 


S71022 


npon i jicni -rp 1 atf*H 

CI 40 product 
"human, thyroid 
carcinoma cells, 
mRNA, 670 nt] 




Df (L LIT TV/t ANI 


PROTEIN L6 (TAX- 
RESPONSIVE 
ENHANCER 
ELEMENT 
BINDING PROTEIN 
107) (TAXREB107) 


!>.0 


584 


L20934 


Anopheles 
gambiae complete 
mitochondrial 
genome 


0.014 


<NONE> 


<NONE> 


<NONE> 


585 


Z49269 


H. sapiens gene 
for chemokine 
HCC-1. 


1.1 


AMY1_DICTH 


ALPHA-AMYLASE 
1 (EC 3.2.1.1) (1,4- 
ALPHA-D- 
GLUCAN 

GLUCANOHYDRO 
LASE) 


2.5 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


586 


IJ95098 


mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


9 ftflF-ftd 








587 




Hnmrt canipn c i_ 

beta-l,3-N- 
acetylglucosamin 
yltransferase 
mRNA, complete 


ft 1 T 


ucurj pi/-; 


ri tM Ur c A i N 

PRECURSOR 
(HYALURONIDAS 
E) (EC 3.2.1.35) 


1 < 

J.J 


588 


J05109 


T.thermophila 
calcium-binding 
25 kDa (TCBP 
25) protein gene, 
complete cds. 


0.014 


<NONE> 


<NONE> 


<NONE> 


589 


U95098 


Xenopus laevis 
miiouc 

phosphoprotein 
44 mRNA, partial 
cds 


6.00E-04 


<NONE> 


<NONE> 


<NONE> 


590 


AF060246 


Mus musculus 
strain C57BL/6 
zinc finger protein 

l\3\) \JLip\\J\J) 

mRNA, H3a-a 
allele, complete 
cds 


1.00E-83 


SCRB_PEDPE 


SUCROSE-6- 
PHOSPHATE 
HYDROLASE (EC 
j.z.l.zo) 
(SUCRASE) 


10 


591 


Y11966 


B.aphidicola (host 
T.suberi) plasmid 
pBTsl genes 
leuA, hspA, 
repA2, repAl, 
leuB leuC leuD 
leuA 


0.37 


<NONE> 


<NONE> 


<NONE> 


592 


U20428 


Human SNC 19 
mRNA sequence 


1.00E-64 


YY22J4YCTU 


HYPOTHETICAL 
30.8 KD PROTEIN 
CY49.22 


0.29 


593 


AF043084 


Lycopersicon 
esculentum 
ethylene receptor 
homolog (ETR1) 
mRNA, complete 
cds 


0.37 


KNIR_DROME 


ZYGOTIC GAP 
PROTEIN KNIRPS 


9.9 


594 


X65279 


pWE15 cosmid 
vector DNA 


5.00E-66 


COA1_SV40 


COAT PROTEIN 
VP1 


0.001 


595 


U95098 


Xenopus iaevis 
mitotic 

phosphoprotein 
44 mRNA, partial 


0.041 


UL88_HSV7J 


PROTEIN U59 


5.8 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






cds 










596 


M91452 


Sus scrofa 
ryanodine 
receptor (RYR1) 
gene, complete 
cds. 


3.2 


<NONE> 


<NONE> 


<NONE> 


597 


U77327 


Human Ki-1/57 

t ntra ppI 1 1 1 1 a r 

IllLldwwllUIui 

antigen mRNA, 
partial cds 


e-158 


GAT1_CHICK 


ERYTHROID 

1 KANbCKlr 1 ION 

FACTOR (GATA-1) 

(ERYF1) 


1.2 


598 


U77327 


Human Ki-1/57 

intmp<*lln1ar 
UlLlawcilUlcu 

antigen mRNA, 
partial cds 


0 


RPB7ARATH 


DNA-DIRECTED 

DM a 

RNA 

POLYMERASE II 
19 KD 

POLYPEPTIDE (EC 
2.7.7.6) (RNA 

rUL I IVltKAot 11 

SUBUNIT5) 


6.2 


599 


Y 1 6964 


Saccharomyces 
sp. mitochondrial 
DN A for OLI 1 
eene strain ClOl 


0.37 


NMD5YEAST 


NONSENSE- 
MEDIATED MRNA 
DECAY PROTEIN 5 


1.9 


600 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-06 


<NONE> 


<NONE> 


<NONE> 


601 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


602 


AF091046 


Brugia pahangi 
nuclear hormone 
receptor (bhr-1) 
gene, partial cds 


1.1 


INVO_PONPY 


INVOLUCRIN 


0.23 


603 


M87339 


Human 

replication factor 
C, 37-kDasubunit 
mRNA, complete 
cds 


0 


AC12_HUMAN 


ACTIVATOR 1 37 
KD SUBUNIT 
(REPLICATION 
FACTOR C 37 KD 
SUBUNIT) (A 1 37 
KD SUBUNIT) (RF- 
C 37 KD SUBUNIT) 
(RFC37) 


l.OOE-38 


604 


D28116 


Human genes for 
collagen type IV 
alpha 5 and 6, 
exon 1 and exon 


0.39 


<NONE> 


<NONE> 


<NONE> 
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m 

ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






r 










605 


U95102 


Xenopus laevis 
mitotic 

pi 1U3 \Ji l\J yl \J IC 1 1 1 

90 mRNA, 
complete cds 


2.00E-06 


<NONE> 


<NONE> 


<NONE> 


606 


AE001149 


Borrelia 
burgdorferi 

^bCCLIUIl J J Ul f\Jj 

of the complete 
genome 


0.13 


<NONE> 


<NONE> 


<NONE> 


607 


X14168 


Human pLC46 
with DNA 
replication origin 


6.00E-16 


Z136_HUMAN 


ZINC FINGER 
PROTEIN 136 


0.31 


608 


Z57610 


H. sapiens CpG 
DNA, clone 
187a 10, reverse 
read 

cpgl87al0.rtla. 


7.00E-90 


HN3B_RAT 


HEPATOCYTE 
NUCLEAR 
FACTOR 3-BETA 
(HNF-3B) 


1.00E-19 


609 


U950y8 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.04j 


r OC V_MU U a b 


VbKalCAIN LUKd 

PROTEIN 
PRECURSOR 
(LARGE 
FIBROBLAST 

PROTPOOT VPATSH 
ri\U 1 LUuL I j 

(CHONDROITIN 
SULFATE 
PROTEOGLYCAN 
CORE PROTEIN 2) 
(PG-M) 


J.J 


610 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


7.00E-07 


CA11_CHICK 


PROCOLLAGEN 
ALPHA 1(1) CHAIN 
PRECURSOR 


0.4 


611 


AB007956 


Homo sapiens 
mRNA, 
chromosome 1 
specific transcript 
KIAA0487 


e-106 


RRPB_CVMA5 


RNA-DIRECTED 
RNA 

POLYMERASE (EC 
2.7.7.48) (ORF1B) 


9.7 


612 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.005 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
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DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


613 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


6.00E-05 


UL52_EBV 


HELICASE/PRIMA 
SE COMPLEX 
PROTEIN 
(PROBABLE DNA 
REPLICATION 
PROTEIN BSLF1) 


5.9 


614 


U95760 


Drosophila 
melanogaster 
strawberry notch 
(sno) mRNA, 
complete cds 


3.00E-71 


POLG_PVYHU 


GENOME 
POLYPROTEIN 
(CONTAINS: N- 
TERMINAL 
PROTEIN; HELPER 
COMPONENT 
PROTEINASE (EC 
3.4.22.-) (HC-PRO); 
42-50 KD 
PROTEIN; 
CYTOPLASMIC 
INCLUSION 
PROTEIN (CI); 6 
KD PROTEIN; 
NUCLEAR 
INCLUSION 
PROTEIN A (NI- A) 
(EC 3.4.22.-) (49K 
PROTEINASE) (49 


4.3 


615 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


9.00E-09 


VP3_ROTPC 


INNER CORE 
PROTEIN VP3 


7.7 


£ 1 C 


J05499 


Rattus norvegicus 
L-glutamine 
amidohydrolase 
mRNA, complete 
cds 


e-143 


GLSL_RAT 


GLUTAMINASE, 
LIVER ISOFORM 
PRECURSOR (EC 
3.5.1.2) (GLS) 


7.00E-67 


617 


Ml 9262 


Rat clathrin light 
chain (LCB3) 
mRNA, complete 
cds. 


0.37 


Y642_METJA 


HYPOTHETICAL 
PROTEIN MJ0642 


5.8 


618 


M21191 


Human aldolase 
pseudogene 
mRNA, complete 
cds. 


1.00E-32 


LINl_NYCCO 


LINE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG 


6.00E-17 


619 


. U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-11 


NUCM_BOVIN 


NADH- 
UBIQUINONE 
OXIDOREDUCTAS 
E 49 KD SUB UNIT 
(EC 1.6.5.3) (EC 
1.6.99.3) 
(COMPLEX 1- 
49KD) (CI-49KD) 


0.044 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


620 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 

44 mRNA nflTtial 

^ ^ HlIXl^l.r\, LICU I Jul 

cds 


0.005 


HEMZ_RHOCA 


FERROCHELATAS 
E(EC 4.99.1.1) 
(PROTOHEME 
FERRO-LYASE 1 * 


4.4 


621 


AF041428 


Homo sapiens 
ribosomal protein 
s4 X isoform 

opnp mmniptp 

cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


622 


X07158 


Chironomus 
thummi DNA for 
Cia repetitive 
element 


0.13 


<NONE> 


<NONE> 


<NONE> 


623 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


o AAL7 (\A 

5.0Ub-U4 


<NUnc> 


<INUJNb> 




624 


AF 100470 


Rattus norvegicus 
ribosome attached 
membrane protein 
4 (RAMP4) 
mRNA, complete 
cds 


l.OOE-53 


<NONE> 


<NONE> 


<NONE> 


625 


U85193 


Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
complete cds 


2.00E-38 


<NONE> 


<NONE> 


<NONE> 


626 


Ml 3452 


Human lam in A 
mRNA, 3'end. 


6.00E-16 


<NONE> 


<NONE> 


<NONE> 


627 


IJQS0Q4 


XpnnruiQ Iapviq 
y\.di\jpuo lacvio 

XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


0 014 


ACDV RAT 


ACYL-COA 
DEHYDROGENAS 
E, VERY-LONG- 
CHAIN SPECIFIC 
PRECURSOR (EC 
1.3.99.-) (VLCAD) 


4 00E-20 


628 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


629 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


630 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-05 


<NONE> 


<NONE> 


<NONE> 
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631 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 

r»nm nl/^tp rrlc 


6.00E-05 


<NONE> 


<NONE> 


<NONE> 


632 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


6.00E-05 


YS83_CAEEL 


HYPOTHETICAL 
86.9 KD PROTEIN 
ZK945.3 IN 
CHROMOSOME II 


0.65 


633 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-09 


NRP_MOUSE 


NEUROPILIN 
PRECURSOR (A5 
PROTEIN) 


2.7 


634 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-05 


Y4JN_RHISN 


HYPOTHETICAL 
16.3 KD PROTEIN 
Y4JN 


5.9 


635 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete uub 


6.00E-05 


<NONE> 


<NONE> 


<NONE> 


636 


X64707 


H.sapiens BBC1 
mRNA 


e-179 


RL13_HUMAN 


60S R1BOSOMAL 
PROTEIN L13 
(BREAST BASIC 
CONSERVED 
PROTEIN 1) 


5.00E-40 


637 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-08 


<NONE> 


<NONE> 


<NONE> 


638 


X14168 


Human pLC46 
with DNA 
replication origin 


5.00E-14 


SP3_HUMAN 


TRANSCRIPTION 
FACTOR SP3 (SPR- 
2) (FRAGMENT) 


0.19 


639 


X90999 


H.sapiens mRNA 
forGlyoxalase II 


9.00E-20 


GL02HUMAN 


HYDROXYACYLG 
LUTATHIONE 
HYDROLASE (EC 
3.1.2.6) 


0.007 


640 


AF083322 


Homo sapiens 
centriole 

associated protein 
CEPllOmRNA, 
complete cds 


9.00E-51 


KIF4_MOUSE 


KINESIN-LIKE 
PROTEIN KIF4 


0.005 
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ACCESSION 


DESCRIPTION 


P VALUE 


641 


Z 12002 


Mmusculus Pvt-1 

ItlKiNA. 


0.36 


CP5F_CANTR 


CYTOCHROME 

P/ICA T II A C 

(ALKANE- 
INDUCIBLE)(EC 
1.14.14.1) (P450- 
ALK3) 


5.6 


642 


Ml 0206 


R.sphaeroides 
reaction center L 
subunit (complete 

/-* /4 c*\ on/I N A 

cosy ana ivi 
subunit (5* end) 
genes. 


1.1 


YGR1_YEAST 


HYPOTHETICAL 
34.8 KD PROTEIN 

1 XT CI 1T1 Drv 1 

IN bU 1 1-KCK1 
REGION 


0.006 


643 


IvUZOOo 


C LUll UU1 gene 

encoding D- 
alanine:D- alanine 
ligase and ftsQ 

unH ftcA cptipc 

complete cds, and 
ftsZ gene, 5* end. 




ANKR HUMAN 


VARIANT 1 
(ANKYR1N B) 
(ANKYRIN, 
NONFRYTHROID'l 


7 flflF-07 


644 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


645 


X53616 


C.domesticus 
calnexin (pp90) 
mRNA 


1.1 


<NONE> 


<NONE> 


<NONE> 


646 


X57010 


Human COL2A1 
gene for collagen 
II alpha 1 chain, 
exons E2-E15 


3.3 


PRIO_PIG 


MAJOR PRION 
PROTEIN 

PPFPTTPQnP fPPP^ 


1.9 


647 


U95097 


Xenopus laevis 
mitotic 

phosphoprotein 
43 mRNA, partial 
cds 


1.1 


UL07_HSV2H 


PROTEIN UL7 


7.3 


648 


X52956 


Human CAMII- 
psi3 calmodulin 
retropseudogene 


0.37 


PRTP_EBV 


PROBABLE 
PROCESSING AND 
TRANSPORT 
PROTEIN 


7.5 


649 




Human protein 

tyrosine 

phosphatase 

(PTP-PEST) 

mRNA, complete 

cds. 


A 
U 


pTVTp TJI ]\A A XT 


PR OTFfM 

TYROSINE 
PHOSPHATASE Gl 
(EC 3.1.3.48) 
(PTPG1) 


e- i\j / 


650 


L47615 


Mus musculus 
DNA-binding 
protein (Fli-1) 
gene, 5' end of 
cds. 


0.13 


YA53_SCHPO 


HYPOTHETICAL 
24.2 KD PROTEIN 
C13A11.03IN 
CHROMOSOME I 


2.00E-07 


651 


U60337 


Homo sapiens 
beta-mannosidase 
mRNA, complete 


0 


GIL1_ENTHI 


GALACTOSE- 
INHIBITABLE 
LECTIN 170 KD 


0.22 
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ID 
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DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






cds 






SUBUNIT 




652 


UUoo 1 J 


Oryctolagus 

cuniculus 

Na+/glucose 

cotransporter- 

related protein 

mRNA, complete 

cds. 


l .UUE.-ZZ 


IN AO l_riUIV] AIN 


E 

COTRANSPORTER 
1 (NA(+)/GLUCOSE 
COTRANSPORTER 
1)(HIGH 
AFFINITY 
SODIUM- 
GLUCOSE 
COTRANSPORTER 
) 


0 1 


653 


Y00282 


Human mRNA 
for ribophorin II 


2.00E-78 


RIB2_HUMAN 


DOLICHYL- 
DIPHOSPHOOLIG 
OSACCHA RIDE- 
PROTEIN 

UL I LUo YLI KAINa 

FERASE 63 KD 
SUBUNIT 
PRECURSOR (EC 

9 4 1 1 10A 
z.*t. i . 1 1 y ) 

(RIBOPHORINII) 


5.00E-19 


654 


D10051 


Human gene for 
92-kDa type IV 
collagenase, 5*- 

flanHno' rAoinri 


0.014 


TAGBJ3ICDI 


PRESTALK- 
SPECIFIC 
PROTEIN TAGB 
PRECURSOR (EC 
3.4.21.-) 


7.6 


655 


M29930 


Human insulin 
receptor (allele 2) 
gene, exons 14, 
15 16 and 17 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


656 


U78310 


Homo sapiens 
pescadillo 
mRNA, complete 
cds 


0 


YG2S_YEAST 


HYPOTHETICAL 
69.9 KD PROTEIN 
IN MIC1-SRB5 
INTERGEN1C 
REGION 


0.002 


657 


X68792 


S.coelicolor 
A3(2) promoter 
sequence pth270 


3.2 


YBS0_YEAST 


HYPOTHETICAL 
27.0 KD PROTEIN 
IN VAL1-HSP26 
INTERGENIC 
REGION 


0.073 


658 


U50535 


Human BRCA2 
region, mRNA 
sequence CG006 


4.00E-12 


ALU1_HUMAN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


1.2 
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SEQ 
ID 
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DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


659 


U15522 


Sus scrofa clone 
pvglalgheavy 
chain variable 
v Lfj region 
mRNA, partial 
cds. 


3.2 


Z165_HUMAN 


ZINC FINGER 
PROTEIN 165 


3.2 


660 


M20918 


C.thummi piger 
haemoglobin (Hb) 
gene DNA, 
complete cds. 


0.12 


YT25_CAEEL 


HYPOTHETICAL 
59.9 KD PROTEIN 
B0304.5 IN 
CHROMOSOME II 


0.033 


661 


U60337 


Homo sapiens 
beta-mannosidase 
mRNA, complete 
cds 


0 


<NONE> 


<NONE> 


<NONE> 


662 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.001 


ENV_MLVFP 


ENV 

POLYPROTEIN 
PRECURSOR 
(CONTAINS: 
KNOB PROTEIN 
GP70; SPIKE 

DDATC1XI D1 CC' D 

PROTEIN) 


3.3 


663 


M97287 


Human 

MAR/SAR DNA 
binding protein 

/O A TO 1 \ m DM A 

(bA 1 d I) mKMA, 

complete cds. > :: 
gb|I58691|I58691 
Sequence 1 from 
patent US 
5652340 


0 


SAT 1 HUMAN 


DNA-BINDING 
PROTEIN SATB1 
(SPECIAL AT-RICH 

BINDING PROTEIN 
1) 


2.00E-20 


664 


L42612 


Homo sapiens 
keratin 6 isoform 
K6f (KRT6F) 
mRNA, complete 
cds 


e-ioo 


von a Rrw/ixi 

KxL,4_dU V IfN 


CYTOSKELETAL 
59 KD, 

COMPONENT IV 


A ftflF-10 

*T.V/UI_* 1 V 


665 


U17901 


Rattus norvegicus 

2-activating 
protein (plap) 
mRNA, complete 
cds. 


e-152 


PLAP_MOUSE 


PHOSPHOLIPASE 
A-2-ACTIVATING 
PROTEIN (PLAP) 


4.00E-13 


666 


M73047 


Homo sapiens 
tripeptidyl 
peptidase II 
mRNA, complete 
cds. 


0 


MERT_STRLI 


MERCURIC 

TRANSPORT 

PROTEIN 

(MERCURY ION 

TRANSPORT 

PROTEIN) 


4.4 
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ID 
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DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


667 


U09954 


Human ribosomal 
protein L9 gene, 
5 1 region and 
complete cds. 


0 


RL9_HUMAN 


60S RIBOSOMAL 
PROTEIN L9 


2.00E-11 


668 


X98330 


H.sapiens mRNA 
for ryanodine 
receptor 2 


1.1 


HS74_MOUSE 


HEAT SHOCK 70 
KD PROTEIN AGP- 
2 


0.034 


669 


U95094 


Xenopus laevis 

VT TNT/^CMD 

AL-lNv^tilNr 

(XL-INCENP) 
mRNA, complete 
cds 


0.002 


RPC2_DROME 


DNA-DIRECTED 

K.NA 

POLYMERASE III 
128 KD 

POLYPEPTIDE 


1.1 


670 


AF069250 


Homo sapiens 

okadaic acid- 

inducible 

phosphoprotein 

(OA48-18) 

mRNA, complete 

cds 


7.00E-80 


LEGBPEA 


LEGUMIN B 
(rKAOMbN I ) 


0.011 


671 


Z71419 


S.cerevisiae 
chromosome XIV 
reading frame 
ORE YNL143C 


1.1 


FOCD_ECOLI 


ni inrrrr> 
UU IbK 

MEMBRANE 

USHER PROTEIN 

FOCD 

PRECURSOR 




672 


AF044965 


Homo sapiens 
polio virus related 
protein 2 gene, 
alpha i so form, 
exon 6 and partial 
cds 


e-167 


PVR_MOUSE 


POLIOVIRUS 

RECEPTOR 

HOMOLOG 


1.00E-12 


673 


X65319 


Cloning vector 
pc a i -cjinancer 


2.00E-80 


S106JKUMAN 


CALCYCLIN 

fPROT APT1N 
[rfvWLnL 1 1 IN 

RECEPTOR 
ASSOCIATED 
PROTEIN) 
CALCIUM- 
BINDING PROTEIN 
A6) 


3.00E-15 


674 




Pia mRNA for 
UMP-CMP 
kinase, complete 
cds 


e-103 


V319 ASFB7 


J319 PROTEIN 


4.3 


675 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


8.00E-08 


VEGR_RAT 


VASCULAR 
ENDOTHELIAL 
GROWTH FACTOR 
RECEPTOR 1 
PRECURSOR 
RECEPTOR FLT) 
(FLT-1) 


3.3 
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SEQ 
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P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


676 


D90217 


S. cerevisiae gene 
for YmL33, 
mitochondrial 
ribosomal 
proteins of large 
suDunu 


2.00E-07 


MALY_ECOLI 
•* 


MALY PROTEIN 
(EC 2.6.1.-) 


5.6 


677 


AF038952 


Homo sapiens 
cofactor A protein 
mRNA, complete 
cds 


e-160 


TlCA_MOUSE 


TCP1- 

CHAPERONIN 
COFACTOR A 


4.00E-19 


678 


Z96950 


Gorilla gorilla 
DNA sequence 
orthologous to the 
human Xp:Yp 
telomere-junction 
region 


j.uUb- 14 


VUD7 nrAi T 


ri i r\J 1 Mb 1 \\s AL 

43.3 KDGTP- 
BINDING PROTEIN 
IN DACB-RPMA 
INTERGENIC 




679 


D50418 


Mouse mRNA for 
AREC3, partial 
cds 


2.00E-79 


CYGX_RAT 


OLFACTORY 
GUANYLYL 
CYCLASE GC-D 
PRECURSOR (EC 
4.6.1.2) 


1.1 


680 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


8.00E-08 


P2C2_SCHPO 


PROTEIN 

PHOSPHATASE 2C 
3.1.3.16) 


1.00E-04 


681 


AL010280 


Plasmodium 
falciparum DNA 
*** 

IN PROGRESS 
*** from contig 
4-106, complete 
sequence 


0.12 


<NONE> 


<NONE> 


<NONE> 


682 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


5.00E-04 


VSM2JTRYBB 


VARIANT 
SURFACE 
GLYCOPROTEIN 
MITAT 1.2 
PRECURSOR (VSG 
221) 


4.3 


683 


U00238 


Homo sapiens 
glutamine PRPP 
amidotransferase 
(GPAT) mRNA, 
complete cds 


0 


<NONE> 


<NONE> 


<NONE> 


684 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.005 


PRPR_SALTY 


PROPIONATE 

CATABOLISM 

OPERON 

REGULATORY 

PROTEIN 


1.5 



270 



WO 99/33982 



PCIYUS98/27610 



Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
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ACCESSION 


DESCRIPTION 


P VALUE 


685 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 

OA _,T)\|A 

complete cds 


7.00E-07 


YAND_SCHPO 


HYPOTHETICAL 
30.4 KD PROTEIN 
C3H1.13IN 

UrlKlJiVlL/olJIVin. 1 


0.38 


686 


D25538 


Human mRNA 
for KIAA0037 
gene, complete 
cds 


0 


<NONE> 


<NONE> 


<NONE> 


687 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-07 


A1AA_RAT 


ALPHA- 1 A 
ADRENERGIC 
RECEPTOR (RA42) 


4.4 


ASS 
Ooo 


L26956 


Mesocricetus 
auratus stearyl- 
CoA desaturase 
sequence 
including male 
hormone 
dependent gene 
derived from 
hamster 
frankorgan 


4.00E-33 


<NONE> 


<NONE> 


<NONE> 


689 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


690 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-09 


Y093_CAEEL 


HYPOTHETICAL 
58.5 KD PROTEIN 
T20B12.3 IN 
CHROMOSOME III 


2.00E-08 


691 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 


8.00E-09 


<NONE> 


<NONE> 


<NONE> 


692 


AB017026 


Mus muscuius 
mRNA for 
oxysterol-binding 
protein, complete 
cds 


0 


OXYB_RABIT 


OXYSTEROL- 
BINDING PROTEIN 


1.00E-34 


693 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


6.00E-04 


UF02_MAIZE 


FLAVONOL 3-0- 
GLUCOSYLTRANS 
FERASE (EC 
2.4.1.91) 


3.1 
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ACCESSION 


DESCRIPTION 


P VALUE 


694 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


5.00E-04 


<NONE> 


<NONE> 


<NONE> 


695 


U34954 


Caenorhabditis 
elegans 
cyclophilin 
isoform 10 


5.00E-24 


CYPA_CAEEL 


PEPTIDYL- 
PROLYL CIS- 

TD A XT C 

ISOMERASE 10 
(EC 5.2.1.8) 


2.00E-29 


696 


AB011167 


Homo sapiens 
mRNA for 
KIAA0595 
protein, partial 
cds 


0 


RFX5_HUMAN 


BINDING 

REGULATORY 

FACTOR 


2.1 


697 


U03886 


Human GS2 
mRNA, complete 
cds. 


2.00E-28 


SKDl_MOUSE 


SKD1 PROTEIN 


4.00E-I7 


698 


AF086275 


Homo sapiens full 
length insert 
cDNA clone 
ZD45C02 


3.00E-41 


SPT7_YEAST 


TRANSCRIPTIONA 
L ACTIVATOR 
SPT7 


0.82 


U77 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-10 


CA1E_HUMAN 


COLLAGEN 
ALPHA 1(XV) 
CHAIN 
PRECURSOR 


1.1 


700 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


4.00E-11 


E434_ADECC 


Q65962 canine 
adenovirus type 1 
(strain ell), early e4 
31 kd protein. 11/98 


4.4 


701 


LI 7340 


Drosophila 
melanogaster 
germline 
transcription 
factor gene, 
complete cds. 


3.3 


CISY_TETTH 


CITRATE 
SYNTHASE, 
MITOCHONDRIAL 
PRECURSOR (EC 
4.1.3.7) (14 NM 
FILAMENT- 
FORMING 
PROTEIN) 


9.7 


702 


X58170 


M.musculus 
mRNA for t- 
ComplexTcp-lOa 
gene 


2.00E-45 


PME2_LYCES 


PECTINESTERASE 
2 PRECURSOR (EC 
3.1.1.11) (PECTIN 
METHYLESTERAS 
E) (PE 2) 


7.4 


703 


Z96207 


H.sapiens 
telomeric DNA 
sequence, clone 
12PTEL049,read 
12PTELOO049.se 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 
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ACCESSION 


DESCRIPTION 


P VALUE 






q 










704 


X58430 


Human Hoxl.8 
gene 


e-146 


HXAAJ-IUMAN 


HOMEOBOX 
PROTEIN HOX-A10 
(HOX-1H) (HOX- 
1.8) (PL) 


4.00E-05 


705 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


6.00E-06 


YN39_SYNP7 


HYPOTHETICAL 
9.2 KD PROTEIN 
IN CYST-CYSR 
INTERGENIC 
REGION (ORF 81) 


0.89 


706 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-11 


MYSH_BOVlN 


MYOSIN I HEAVY 
CHAIN-LIKE 
PROTEIN (MIHC) 
(BRUSH BORDER 
MYOSIN I) (BBM1) 


0.001 


707 


M19961 


Human 
cytochrome c 
oxidase subunit 
Vb (coxVb) 
mRNA, complete 
cds. 


e-123 


OTHU5B 


<NONE> 


3.00E-30 


708 


X68380 


M.musculus gene 
for cathepsin D, 
exon 3 


5.00E-04 


42JVIOUSE 


ERYTHROCYTE 
MEMBRANE 
PROTEIN BAND 
4.2 (P4.2) 

/DAI T TrYTKA 


9.9 


709 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


1.00E-11 


TCPA_DROME 


T-COMPLEX 
PROTEIN 1, 
ALPHA SUBUNIT 
(TCP-1-ALPHA) 


4.3 


710 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 

pnmnlpfp pH« 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


711 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


4.00E-12 


<NONE> 


<NONE> 


<NONE> 


712 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.002 


<NONE> 


<NONE> 


<NONE> 
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ACCESSION 
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713 


AB018323 


Homo sapiens 
mRNA for 
K1AA0780 
protein, partial 
cds 


3.00E-41 


LBR_CHICK 


LAMIN B 
RECEPTOR 


3.4 


714 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-06 


YM8LYEAST 


HYPOTHETICAL 
71.1 KD PROTEIN 
IN DSK2-CAT8 
INTERGENIC 


3.00E-08 


715 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


4.00E-13 


PSC_DROME 


POSTERIOR SEX 
COMBS PROTEIN 


0.6 


716 


L28101 


Homo sapiens 
kallistatin (PI4) 
gene, exons 1-4, 
complete cds 


7.00E-07 


IRKXRAT 


INWARD 
RECTIFIER 
POTASSIUM 
CHANNEL BIR9 
(KIR5.1) 


5.4 


717 


AC001O38 


Homo sapiens 
(subclone 2 h2 
from PI H49) 
DNA sequence 


8.00E-09 


MGMT_YEAST 


Mb I HYLA1 hU- 

DNA--PROTE1N- 

CYSTEINE 

MC 1 n i L 1 i\J\ [Nor c, 

RASE 




718 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mKJNA, complete 
cds 


1.00E-11 


YWDE_BACSU 


HYPOTHETICAL 
19.9 KD PROTEIN 
IN SACA-UNG 

REGION 
PRECURSOR 


1.8 


719 


U01139 


Mus musculus 
B6D2F1 clone 

1 R mTJtO A 
ZV*. 1 1 D IIli\.lN/\. 


e-110 


GSCDROME 


HOMEOBOX 
PROTEIN 


7.2 


720 


ABO 17430 


Homo sapiens 
mRNA for 
kinesin-like DNA 
binding protein, 


0 


YBAV_ECOLI 


HYPOTHETICAL 
12.7 KD PROTEIN 
IN HUPB-COF 
INTERGENIC 
REGION 


0.17 


721 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


0.001 


CPCF_SYNP2 


PHYCOCYANOBIL 
IN LYASE BETA 
SUBUNIT(EC 4.-.-.- 
) 


2.4 


722 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


9.00E-10 


<NONE> 


<NONE> 


<NONE> 
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723 


U95102 


Xenopus iaevis 
mitotic 

phosphoprotein 
90 mRNA, 
comnlete cds 


0.04 


YKK7_CAEEL 


HYPOTHETICAL 
54.9 KD PROTEIN 
C02F5.7 IN 
CHROMOSOME III 


0.057 


724 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


8.00E-08 


H5JTAIMO 


HISTONE H5 


0.39 


725 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
rnrviN/A, Lumpicic 
cds 


3.00E-09 


DED1_YEAST 


PUTATIVE ATP- 

DEPENDENTRNA 

HELICASEDEDI 


0.5 


726 


J04617 


Human elongation 
factor EF-1 -alpha 
gene, complete 
cds. > :: 

9DNA of human 
polypeptide chain 
elongation factor- 

1 fllnha 


5.00E-36 


ALU7_HUMAN 


!!!! ALU 

SUBFAMILY SQ 
WARNING ENTRY 
!!!! 


0.84 


727 


X54859 


Porcine TNF- 
alpha and TNF- 
beta genes for 
tumour necrosis 
factors atoha and 
beta, respectively. 


3.3 


Z165_HUMAN 


ZINC FINGER 
PROTEIN 165 


5.6 


728 


D49911 


Thermus 
thermophilus 
UvrA gene, 
complete cds 


0.014 


CC48_CAPAN 


CELL DIVISION 
CYCLE PROTEIN 
48 HOMOLOG 


9.9 


729 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-06 


CA25_HUMAN 


PROCOLLAGEN 
ALPHA 2(V) 
CHAIN 
PRECURSOR 


0.011 


730 


D15057 


Human mRNA 
for DAD-1, 
complete cds 


0 


DAD1 HUMAN 


DEFENDER 
AGAINST CELL 
DEATH 1 (DAD-1) 


8.00E-16 


731 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


6.00E-06 


ANFD_RHOCA 


NITROGENASE 
IRON-IRON 
PROTEIN ALPHA 
CHAIN (EC 
1.18.6.1) 

(NITROGENASE 
COMPONENT I) 
(DINITROGENASE 


9.6 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












) 




732 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


7.00E-07 


EFTUCHLVI 


ELONGATION 
FACTOR TU (EF- 
TU) 


2.5 


733 


AB018335 


Homo sapiens 
mRNA for 
KIAA0792 
protein, complete 
cds 


0 


TRYM_RAT 


MAST CELL 
TRYPTASE 
PRECURSOR (EC 
3.4.21.59) 


5.6 


734 


X98743 


H.sapiens mRNA 
for RNA helicase 
(Myc-regulated 
dead box protein) 


0.04 


<NONE> 


<NONE> 


<NONE> 


735 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-07 


<NONE> 


<NONE> 


<NONE> 


736 


Z49314 


S.cerevisiae 
chromosome X 
reading frame 
ORF YJL039c 


3.2 


<NONE> 


<NONE> 


<NONE> 


737 


D 12646 


Mouse kif4 
mRNA for 
microtubule- 
based motor 
protein KIF4, 
complete cds 


0 


KIF4_MOUSE 


KINESIN-LIKE 
PROTEIN KIF4 


2.00E-76 


738 


J04038 


Human 

glyceraldehyde-3- 

phosphate 

dehydrogenase 


2.00E-47 


SDC1_HUMAN 


SYNDECAN-1 
PRECURSOR 
(SYND1)(CD138) 


3.5 


739 


AF010238 


Homo sapiens 
von Hippel- 
Lindau tumor 
suppressor 


1.00E-09 


LIN1_HUMAN 


L1NE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG 


0.001 


740 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-06 


YQJX_BACSU 


HYPOTHETICAL 
13.2 KD PROTEIN 
IN GLNQ-ANSR 
INTERGENIC 
REGION 


9.9 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


741 


T 1 1 1 %f\ 

LZ 1 1 OO 


nuiiiaii iybyi 
oxidase-Hke 
protein mRNA, 
complete cds. 


e- ihj 


OYRTl 
v/Alv 1 L> 




1.00E-34 


742 


U95094 


Xenopus laevis 
XL-INCENP 

^ A Lf ~ 1 IS \-* ON r ) 

mRNA, complete 
cds 


2.00E-05 


CC48_SOYBN 


CELL DIVISION 
CYCLE PROTEIN 

(VALOSIN 
CONTAINING 
PROTEIN 
HOMOLOG)(VCP) 


7.6 


743 


AF009203 


Homo sapiens 

VAT pinna 

ial cione 

377A1 unknown 

mRNA, 

3 'untranslated 

region 


3.3 


<NONE> 


<NONE> 


<NONE> 


744 


77/1 QQA 


S.cerevisiae 
chromosome XV 
reading frame 
ORFYOL152w 


f\ 1 O 


r>ni A DA n IT 
L,L/14_KAdJ I 


i^zooou orycioiagus 
cuniculus (rabbit), 
monocyte 
differentiation 
antigen cdl4 

nrpr*nr<;r»r 1 1 /OR 

}J1 Gl>Ul 31/1 . 1 I/70 


1 Q 

1 Jy 


745 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


9.00E-10 


KIN3JYEAST 


SERINE/THREONI 
NE-PROTEIN 
KINASE KIN3 (EC 
2.7.1.-) 


2.5 


746 


U7J 


A-CllOpUb lac V Id 

mitotic 

phosphoprotein 
90 mRNA, 
complete cds 






H YPOTHFTIP A I 

24.2 KD PROTEIN 
C13A11.03IN 
CHROMOSOME I 


7 OnF-17 

/ .\J\J 1 / 


747 




r\L^LJxxJ — aiuciiyu 

e dehydrogenase 
isozyme 3 
[human, stomach, 
mRNA Partial, 
1362 nt] 


o 

V 


DHAP HUMAN 


ALDEHYDE 
DEHYDROGENAS 
E, DIMERIC NADP- 
PREFERRING (EC 
1.2.1.5) (CLASS 3) 


2 00E-71 


748 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


2.00E-08 


CA1E_CHICK 


COLLAGEN 
ALPHA l(XIV) 
CHAIN 
PRECURSOR 
(UNDULIN) 


0.36 


749 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-06 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


750 


L14815 


Entamoeba 
histolytica HM- 
] :IMSS galactose- 
specific adhesin 
170kD subunit 
(hg!3) gene, 
complete cds. 


0.12 


<NONE> 


<NONE> 


<NONE> 


751 


X63785 


T.thermophila 
gene for snRNA 
U2-2 


1.1 


<NONE> 


<NONE> 


<NONE> 


752 


M83756 


Mytilus edulis 
mitochondrial 
NADH 

dehydrogenase 
subunit 5 (ND5) 
gene, 3* end; 
NAUH 

dehydrogenase 
subunit 6 (ND6) 
gene, complete 
cds; and 

cytocnrome o \cyi 
b), 5' end. 


0.042 


DSC1_HUMAN 


DESMOCOLLIN 
1A/1B 

r»r» r?/™»T rr» o /~\r> 

PRECURSOR 
(DESMOSOMAL 
GLYCOPROTEIN 
2/3) (DG2/DG3) 


2.6 


753 


AB001066 


Brown trout 
microsatellite 
DNA sequence 


0.38 


IMB3_HUMAN 


IMPORTIN BETA-3 
SUBUNIT 
(KARYOPHERIN 
BETA-3 SUBUNIT) 


1.2 


754 


AF064787 


Lotus japonicus 
rac GTPase 
activating protein 

complete cds 


0.51 


<NONE> 


<NONE> 


<NONE> 


755 


U20608 


Dictyostelium 
discoideum 
unknown spore 
germination- 
specific protein- 
like orotein orfl 
orf2 and orf3 
genes, complete 
cds 


0.043 


<NONE> 


<NONE> 


<NONE> 


756 


M77812 


Rabbit myosin 
heavy chain 
mRNA, complete 
cds. 


1.2 


RBL1_HUMAN 


RETINOBLASTOM 
A-LIKE PROTEIN 1 
(107 KD 

RETINOBLASTOM 
A-ASSOCIATED 
PROTEIN) (PRB1) 
(P107) 


4.9 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


757 


AOJ foy 


T.thermophila 
genes for snRNA 
U5-l,snRNA U5- 
2 


U.UDo 






<NONE> 


758 




Mouse mRNA for 
SDF2, complete 
cds 




puTi VFA<\T 
I IV l l J I dr\j 1 


HOT 1THYT - 

PHOSPHATE- 

MANNOSE-- 

PROTEIN 

MANNOSYLTRAN 

<?FFR A<sF 1 (PC 

2.4.1.109) 


0.002 


759 


L81583 


Homo sapiens 
(subclone 3^g2 
from PI Hll) 
DNA sequence 


3.00E-J9 


ALU5_HUMAN 


!!!! ALU 

SUBFAMILY SC 
WARNING ENTRY 

1 IT t 


0.86 


760 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-06 


SYFA_YEAST 


PHENYLALANYL- 
TRNA 

SYNTHETASE 
ALPHA CHAIN 


5.7 


761 


AF000370 


Homo sapiens 
polymorphic CA 
dinucleotide 
repeat flanking 
region 


6.00E-89 


APP1JVIOUSE 


AMYLOID-LIKE 
PROTEIN 1 
PRECURSOR 
(APLP) 


5.7 


762 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


763 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-06 


PSF_HUMAN 


PTB-ASSOCIATED 
SPLICING FACTOR 
(PSF) 


0.72 


764 


AB018288 


Homo sapiens 
mRNA for 
KIAA0745 
protein, partial 
cds 


0 


TC2A_CAEBR 


TRANSPOSABLE 
ELEMENT TCB2 
TRANSPOSASE 


1.5 


765 


AF020282 


Dictyostelium 
discoideum 
DG2033 gene, 
partial cds 


0.38 


PMT2_YEAST 


DOLICHYL- 

PHOSPHATE- 

MANNOSE- 

PROTEIN 

MANNOSYLTRAN 

SFERASE 2 (EC 

2.4.1.109) 


0.18 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


766 


AF017357 


Oryza sativa low 
molecular early 
light-inducible 
protein mRNA, 
complete cds 


0.38 


RGS3_HUMAN 


REGULATOR OF 
G-PROTEIN 
SIGNALLING 3 
(RGS3) (RGP3) 


0.23 


767 


U67599 


Methanococcus 
jannaschii section 
141 of 150 of the 
complete genome 


0.13 


<NONE> 


<NONE> 


<NONE> 


768 


X74178 


B.taurus 
microsatellite 
DNA INRA153 


0.13 


FAG1SYNY3 


P73574 

synechocystis sp. 
(strain pec 6803). 3- 
oxoacyl-[acyl -carrier 
protein] reductase 1 
(ec 1.1.1.100) (3- 
ketoacyl- acyl carrier 
protein reductase 1). 
11/98 


5.00E-16 


769 


AF041858 


Mus musculus 
synaptojanin 2 
isoform delta 
mRNA, partial 
cds 


0.043 


CA44_HUMAN 


COLLAGEN 
ALPHA 4(1 V) 
CHAIN 
PRECURSOR 


0.24 


770 


JO 1404 


Drosophila 
melanogaster 
mitochondrial 
cytochrome c 
oxidase subunits, 
ATPase6, 7 
tRNAs (Trp, Cys, 
Tyr, Leu(UUR), 
Lys, Asp, Gly) 
genes, and 
unioeniiiicQ 
reading frames 
A61, 2 and 3. 


0.021 


NU1MJTITLA 


NADH- 
UB1QUINONE 

LlAiJJUKJilJUL- 1 Ao 

E CHAIN 1 (EC 
1.6.5.3) 


7.2 


771 


AL022317 


Human DNA 
sequence from 
clone 140L1 on 
chromosome 
22ql3.1-13.31, 
complete 
sequence [Homo 
sapiens] 


3.00E-41 


ALU7_HUMAN 


!!!! ALU 

SUBFAMILY SQ 
WARNING ENTRY 

MM 


4.00E-08 


772 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-09 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


773 


AF095927 


Rattus norvegicus 
protein 

phosphatase 2C 
mRNA comrjlete 
cds 


0 


P2C_PARTE 


PROTEIN 

PHOSPHATASE 2C 
(EC 3.1.3.16) (PP2C) 


1.00E-16 


774 


X87212 


H.sapiens mRNA 
for cathepsin C 


0 


CATC_HUMAN 


DIPEPTIDYL- 
PEPTIDASE I 
PRFCURSOR (FC 
3.4.14.1) 


2.00E-46 


775 


X05283 


Drosophila 
melanogaster 
PKCG7 gene 
exons 7-14 for 
protein kinase C 


4.5 


<NONE> 


<NONE> 


<NONE> 


776 




riuman itikina 
for elongation 
factor 1 alpha 
subunit 


u 


PP1 1 Wl 1\4 A\I 
EX 1 I_rlUlVlAjN 


CLUINOA I [\Jri 

FACTOR 1 -ALPHA 
1 (EF-1 -ALPHA- 1) 


1 .UUt-oJ 


777 


X06960 


Aspergillus 

niuuians 

mitochondrial 

DNA for 

cytochrome 

oxidase subunit 3, 

tRNA-Tvr 


0.23 


<NONE> 


<NONE> 


<NONE> 


778 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-09 


YMT8_YEAST 


HYPOTHETICAL 
36.4 KD PROTEIN 
IN NUP1 16-FAR3 
INTERGENIC 
REGION 


5.00E-07 


779 


U95102 


Xenopus laevis 
mitotic 

90 mRNA, 
complete cds 


2.00E-07 


NAT1_YEAST 


N-TERMINAL 
ACETYLTRANSFE 
RASF 1 (FC 
2.3.1.88) 


5.00E-23 


780 


U59706 


Gallus gallus 
alternatively 
spliced AMPA 
glut am ate 
receptor, isoform 
GluR2 flop, 
(GluR2) mRNA, 
partial cds. 


0.014 


PPOL_SARPE 


POLY (ADP- 
RIBOSE) 

POLYMERASE (EC 
2.4.2.30) (PARP) 


0.021 


781 


U57391 


Rattus norvegicus 
FceRI gamma- 
chain interacting 
protein SH2-B 
(SH2-B) mRNA, 
complete cds 


1.00E-84 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
in 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


782 


AB014591 


Homo sapiens 
mRNA for 
KIAA0691 
protein, complete 
cds 


7.00E-57 


SSGP_VOLCA 


SULFATED 
SURFACE 
GLYCOPROTEIN 
185 (SSG 185) 


5.3 


783 


AJ008065 


Chrysolina bankii 
16S rRNA gene, 
mitotype B2 


0.043 


<NONE> 


<NONE> 


<NONE> 


784 


AF067212 


Caenorhabditis 
elegans cosmid 
F37F2 


0.005 


MEK1JRAT 


MAPK/ERK 
KINASE KINASE 1 
(EC 2.7.1.-) (MEK 

rv 1 1> ay j j 


4.5 


785 


U95094 


Xenopus laevis 

XL-INCENP 

(XL-INCENP) 

mP\IA r\TY\ 1 P»tf> 
IIlIxlNAY, V/UIIipiClC 

cds 


0.042 


<NONE> 


<NONE> 


<NONE> 


786 


U95102 


Xenopus laevis 
mitotic 

pnuopnuproicin 
90 mRNA, 
complete cds 


9.00E-09 


<NONE> 


<NONE> 


<NONE> 


787 


Y13401 


Homo sapiens 
CD3 delta gene, 

vlllldll^CI 

sequence 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


788 


AE001038 


Archaeoglobus 
fulgidus section 
69 of 172 of the 
complete genome 


0.13 


<NONE> 


<NONE> 


<NONE> 


789 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-06 


<NONE> 


<NONE> 


<NONE> 


790 


AF041463 


Manihot esculenta 
elongation factor 
1 -alpha 


1.4 


<NONE> 


<NONE> 


<NONE> 


791 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.002 


HXA3_HAEIN 


HEMErHEMOPEXI 
N-BINDING 
PROTEIN 
PRECURSOR 


2.7 


792 


Z12112 


pWE15A cosmid 
vector DNA 


3.00E-29 


PKWA_THECU 


PUTATIVE 
SERINE/THREONI 
NE-PROTEIN 
KINASE PKWA 
(EC 2.7.1.-) 


2.00E-04 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


793 


U85193 


Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
complete cds 


4.00E-44 


<NONE> 


<NONE> 


<NONE> 


794 


U89331 


Human 

pseudoautosomal 
homeodomain- 
containing protein 
(PHOG) mRNA, 
complete cds 


7.00E-06 


NRL_HUMAN 


NEURAL RETINA- 
SPECIFIC 
LEUCINE ZIPPER 
PROTEIN (NRL) 


6.3 


795 


AF055666 


Mus musculus 
kinesin light chain 
2 (KIc2) mRNA, 
complete cds 


0.52 


PSPD_BOVIN 


PULMONARY 
SURFACTANT- 
ASSOCIATED 
PROTEIN D 
PRECURSOR 


0.33 


796 


L13321 


Homo sapiens 
iduronate-2- 
suiiatase uua; 
gene, exon 1 , 
incomplete 5' end. 


0.14 


YRP2_YEAST 


HYPOTHETICAL 
84.4 KD PROTEIN 

IN Kr L,Z/ivC 1 1 

3'REGION 


0.27 


797 


ALO 10270 


Plasmodium 
falciparum DNA 
*** 

SEQUENCING 
IN PROGRESS 
*** from contig 
4-96, complete 
sequence 


0.37 


YTH3_CAEEL 


HYPOTHETICAL 
75.5 KD PROTEIN 

Pi A A A 1 TT\I 

CHROMOSOME II 


2 


798 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.015 


1MB3_HUMAN 


IMPORTIN BETA-3 
SUBUNIT 
(KARYOPHERIN 
BETA-3 SUBUNIT) 


0.063 


799 




Mus musculus 
putative CCR4 
protein mRNA, 
partial cds 


U 




REPRESSIBLE 
ALCOHOL 
DEHYDROGENAS 
E 

TRANSCRIPTIONA 
L EFFECTOR 
(CARBON 
CATABOLITE 
REPRESSOR 
PROTEIN 4) 




800 


L26507 


Mouse myocyte 
nuclear factor 
(MNF) mRNA, 
complete cds. 


3.00E-41 


MNF_MOUSE 


MYOCYTE 
NUCLEAR 
FACTOR (MNF) 


4.00E-18 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


801 


U20527 


Mus musculus 
cncmuKinc rw^ 
gene, 5' region. 


0 


GRO_MOUSE 


GROWTH REGULATED 
PROTFIN PRFCURSOR 
(PLATELET-DERIVED 
GROWTH FACTOR- 
INDUCIBLE PROTEIN 
KC) (SECRETORY 
PROTEIN N51) 


1.00E-28 


802 


AF065482 


Homo sapiens 
sorting nexin 2 
(SNX2) mRNA, 
complete cds 


0 


MYSA_DROME 


MYOSIN HEAVY 
CHAIN, MUSCLE 


0.089 


803 


U05823 


Mus musculus 

pericentrin 

mRNA, 

LUIUUICIC LUJ. 


1.00E-94 


M84D_DROME 


MALE SPECIFIC 
SPERM PROTEIN 
MST84DD 


0.099 




U67468 


Methanococcus 

jannaschii 

sec XI on i u oi 1 jv 

of the complete 

genome 


0.4 


<NONE> 


<NONE> 


<NONE> 


805 


U14178 


Human type II 
IL-1 receptor 
gene, exon IB 


1.00E-19 


A IV >fm 1 UI IA A A XT 

AMPH_HUMAN 


A H^DLITDUVOTXT 


o o 
z.v 


806 


L40411 


Homo sapiens 
thyroid receptor 
interactor 


0 


TRI8_HUMAN 


THYROID RECEPTOR 
INTERACTING 
PROTEIN 8 (TRIP8) 


4.00E-86 


807 


D17218 


Human HepG2 
3' region Mbol 
cDNA, clone 
hmd3g02m3 


e-136 


/"i A 1 A III T* A A XT 

CA1A_HUMAN 


/->/-\t t a pnvi at DUA 

COLLAObN ALPHA 
1(X) CHAIN 
PRECURSOR 




808 


JLD / o tu 


n. sapiens v^pvj 
DNA, clone 
187al0, reverse 
read 

cpgl87al0.rtla. 


C" J v« 




HEPATOCYTE 
NUCLEAR FACTOR 3- 
BETA (HNF-3B) 


1.00E-24 


809 


D 14678 


Human mRNA 
for kinesin- 
related protein, 
partial cds 


0 


NCD_DROME 


CLARET 

SEGREGATIONAL 
PROTEIN 


1.00E-70 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SE$ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


810 


X56317 


Xiphophorus 
maculatus 
Xmrk(proto- 
oncogene) gene 
for receptor 
tyrosine kinase. 


0.49 


urs.ii d x a/~\\ to rr 
WNlD_MUUbb 


M/XTT IAD DD/^TT7IX1 

WIN I-lUtl rKU IfcllN 

PRECURSOR (WNT-12) 


7 9 


811 


M36200 


Human 

synaptobrevin 1 

\D IDIj gCIIC, 

exon 5. 


0.2 


VE2_HPV14 


REGULATORY 
PROTEIN E2 


3.1 


812 


M18157 


Human glandular 
kallikrein gene, 
complete cds. 


1.5 


EKLFMOUSE 


ERYTHROID 

VDTfCDDCI I IV IT 

Is. K U c r r E. u- L I Jv t 
TRANSCRIPTION 
FACTOR (EKLF) 


1.1 


813 


D25215 


Human mRNA 
forKIAA0032 
gene, complete 
cds 


1.9 


YXIS_SACER 


HYPOTHETICAL 28.9 
KD PROTEIN IN XIS 
5'REGION (ORF1) 


1.3 


OlH 


M96628 


Human gene 
sequence, 5' end. 


2.00E-06 


AGRI_DISOM 


AGRIN (FRAGMENT) 


9.5 




Z57610 


H.sapiens CpG 
DNA, clone 
187al0, reverse 
read 

cpgl87al0.rtla. 


e-102 


HN3B_MOUSE 


HEPATOCYTE 
NUCLEAR FACTOR 3- 
BETA (HNF-3B) 


1.00E-19 


816 


X14165 


Human pLC4o 
with DNA 
replication origin 


c nrit: i a 
3.UUb-lo 


Z»N 44_M U M A IN 


44 (ZINC FINGER 
PROTEIN KOX7) 


i .0 


817 


Ml 9262 


Rat clathrin light 
chain (LCB3) 
mRNA 
complete cds. 


0.28 


LMA_DROME 


LAMININ ALPHA 
CHAIN PRECURSOR 


4.7 


818 


AF058055 


Mus musculus 
monocarboxylate 
transporter 1 


0.2 


<NONE> 


<NONE> 


<NONE> 


819 


ABO 14570 


Homo sapiens 
mRNA for 
KIAA0670 
protein, partial 
cds 


0.16 


YGR1_YEAST 


HYPOTHETICAL 34.8 
KD PROTEIN IN SUT1- 
RCK1 INTERGENIC 
REGION 


4.00E-06 


820 


Ml 9262 


Rat clathrin light 
chain (LCB3) 
mRNA, 
complete cds. 


0.27 


LMA_DROME 


LAMININ ALPHA 
CHAIN PRECURSOR 


4.5 



285 



WO 99/33982 



PCT/US98/27610 



Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


821 


Z54367 


H.sapiens gene 
for plectin 


0.29 


Y093_CAEEL 


HYPOTHETICAL 58.5 
KD PROTEIN T20B 12.3 
IN CHROMOSOME III 


K00E-14 


822 


AB017026 


Mus musculus 
mRNA for 
oxysterol- 
binding protein, 
complete cds 


0 


OXYB_HUMAN 


OXYSTEROL-BINDING 
PROTEIN 


2.00E-49 


823 


X58170 


M.musculus 
mRNA fort- 
Complex Tcp- 
1 0a gene 


1.00E-20 


UL52JBV11 


DNA 

HELICASE/PRIMASE 
COMPLEX PROTEIN 
(DNA REPLICATION 
PROTEIN UL52) 


5.3 


824 


X58430 


Human Hoxl.8 
gene 


0 


HXAA_HUMAN 


HOMEOBOX PROTEIN 

HOX-AIO(HOX-IH) 

(HOX-1.8)(PL) 


1.00E-44 


825 


X53754 


Porcine 

sarcoplasmic/end 
oplasmic- 
reticulum Ca(2+) 
pump gene 2 3 1 - 
end region 


1.3 


<NONE> 


<NONE> 


<NONE> 




AB005786 


Arabidopsis 
thaliana tRNA- 
Glu gene 


0.46 


<NONE> 


<NONE> 


<NONE> 


827 


AB0I2130 


Homo sapiens 
SBC2 mRNA for 

bicarbonate 
cotransporter2, 
complete cds 


1.9 


<NONE> 


<NONE> 


<NONE> 


828 


AoO 17430 


Homo sapiens 
mRNA for 
kinesin-like 
DNA binding 
protein, complete 
cds 


0 


YBAV_ECOLI 


HYPOTHETICAL 12.7 
KD PROTEIN IN HUPB- 
COF INTERGENIC 
REGION 


0.063 


829 


AB007886 


Homo sapiens 
KIAA0426 
mRNA, 
complete cds 


0.042 


YDF3_SCHPO 


PROBABLE 
EUKARYOTIC 
INITIATION FACTOR 
C17C9.03 


0.52 


830 


AB018335 


Homo sapiens 
mRNA for 
KIAA0792 
protein, complete 
cds 


e-172 


UROT_BOVIN 


TISSUE PLASMINOGEN 
ACTIVATOR 
PRECURSOR (EC 
3.4.21.68) 


0.86 
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Table 2 





Nearest Neighbor (BlastN vs. Gen bank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


831 


D12646 


Mouse kif4 
mRNA for 

m i f*mtt tHt 1 1 P— 
Iltlwl UIUUU1C" 

based motor 
protein KIF4, 
complete cds 


0 


KIF4_MOUSE 


KINES IN-LIKE 
PROTEIN KIF4 


9.00E-96 




U38376 


Rattus 

norvegicus 

cytosolic 

A2 mRNA, 
complete cds 


0.048 


<NONE> 


<NONE> 


<NONE> 


833 


L40411 


Homo sapiens 
thyroid receptor 
interactor 


0 


TR18HUMAN 


THYROID RECEPTOR 
INTERACTING 
PROTEIN 8 (TRIP8) 


4.00E-86 


834 


U08110 


Mus musculus 
RNA1 homolog 
(Fugl)mRNA, 
complete cds. 


8.00E-04 


YNW7_YEAST 


HYPOTHETICAL 68.8 
KD PROTEIN IN URE2- 
SSU72 INTERGENIC 
REGION 


0.02 


835 


D50646 


Mouse mRNA 
for SDF2, 
complete cds 


1.00E-40 


YB64_YEAST 


HYPOTHETICAL 57.2 
KD PROTEIN IN MET8- 
HPC2 INTERGENIC 
REGION 


4.9 


836 


D50646 


Mouse mRNA 
for SDF2, 
complete cds 


1.00E-40 


YB64_YEAST 


HYPOTHETICAL 57.2 
KD PROTEIN IN MET8- 
HPC2 INTERGENIC 
REGION 


4.9 


837 


U67459 


Methanococcus 
jannaschii 
section 1 of 1 50 
of the complete 
genome 


5.00E-05 


GCS1_HUMAN 


MANNOSYL- 
OLIGOSACCHARIDE 
GLUCOSIDASE(EC 
3.2.1.106) 


9.2 


838 


U 18657 


Haemophilus 

inf1iion7ao I an A 

(leuA) gene, 
partial cds, DprA 
(dprA+), orf272 
and orfl93 
genes, complete 
cds, and PfkA 
(pfkA) gene, 
partial cds. 


0.01 


STE6.YEAST 


MATING FACTOR A 
SECRETION PROTEIN 
STE6 (MULTIPLE 
DRUG RESISTANCE 
PROTEIN HOMOLOG) 
(P-GLYCOPROTEIN) 


7 
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Table 2 





Nearest Neighbor (BiastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


839 


U12523 


Rattus 
norvegicus 
ultraviolet B 
radiation- 
activated UV98 
mRNA, partial 
sequence. 


1.00E-10 


YMT8_YEAST 


HYPOTHETICAL 36.4 
KD PROTEIN IN 
NUP1 16-FAR3 
INTERGENIC REGION 


2.00c-0o 


840 


D78255 


Mouse mRNA 
for PAP-1, 
complete cds 


e-175 


<NONE> 


<NONE> 


<NONE> 


841 


D 17263 


Human HepG2 
3' region Mbol 
cDNA, clone 
hmd5f07m3 


1.00E-58 


<NONE> 


<NONE> 


<NONE> 


842 


AF006751 


Homo sapiens 
ES/130 mRNA, 
Luinpicic tua 


0.061 


YRP2_YEAST 


HYPOTHETICAL 84.4 
KD PROTEIN IN 


2.00E-07 


o*t J 


U67459 


Methanococcus 
jannaschii 
section 1 of 150 
of the complete 
genome 


6.00E-05 


YC14_METJA 


HYPOTHETICAL 
PROTEIN MJ1214 


8.1 


844 


D88689 


Mus musculus 
mRNA for flt-1, 
complete cds 


0.084 


ICP0_HSV2H 


TRANS-ACTING 
TRANSCRIPTIONAL 
PROTEIN ICP0 
(VMW1 18 PROTEIN) 


0.014 
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Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones 




ID 


i :ki 


LlDZ 


JL1D3 


LrlD4 


¥ Shfi 
IjIDO 


T ihO 


X A AAA A 1 1 A AD . A A*C 

MUUUU 1 34Ut>:AUo 


1 HC\&0 
1 /UOZ 


i 
J 


A 
U 


A 
U 


A 

\J 


A 

\J 


o 


\>IAAAA1 1A AH.!? 1 A 

MUUUU l34UL>.r IU 


1 1 £.80 


i 

z 


z 


1 
1 


1 
J 


1 
J 


8 


X vlAAA A 1 1 A 1 A .1? 1 1 

MUUUU 1 34 1 A:b 1 Z 


AAA1 
444J 


1 A 
IU 


0 


1 
Z 


O 


1 
J 


1 1 
i i 


X /AAAA 1 1 A ir> ,r?A£ 

MOUOU 1 342B:bUo 




i 

2 


A 
U 


A 
U 


A 
U 


1 
1 


A 
\} 


X Jf AAAA 1 1 >l I^.P 1 A 

MUUUUl J4jC:r IU 


z/9u 


/ 


1 c 
1 J 


1 1 




O 


A 
U 


X A AAA A 11/11 r\.U AT 

MG0001343D:H07 


once 
232Dj 


1 

3 


A 

u 


i 
1 


1 
1 


A 
U 


A 
U 


X Jf AAAA ITiir a ,rni 

M00001345A:E01 


642U 


Q 

o 


A 
U 


i 

z 


A 

u 


1 
1 


A 


M00001346A:F09 


fAAT 

5UU7 


A 

4 


Q 
O 


1 
3 


/: 
O 


1 

z 


0 


M00001346D:E03 


6806 


5 


1 

2 


i 
1 


i 
Z 


A 

u 


1 
3 


M00001346D:G06 


5779 


5 


A 

4 


i 
3 


4 


A 

u 


A 

U 


v jf a a a a t i a y t*^\ t~+f\f 

M0000I346D:G06 


iT T7 A 

5779 


5 


A 

4 


i 
3 


4 


A 

u 


A 

u 


* # a a a a i ^ i t i i-» i a 

M00001347A:B10 


13576 


5 


A 
0 


A 
0 


A 

u 


1Z 


1 1 


% f AAA A 1 A <4 nn T"» i\ A 

M00001348B:B04 


1 zT AO H 

16927 


4 


A 

u 


0 


1 

z 


A 

u 


A 

u 


M00001348B:G06 


16985 


4 


A 

0 


A 

0 


A 

u 


A 

u 


A 

u 


m i AAAA1 >l Ar» T\ f\C% 

M00001349B:B08 


1 C O A 

3584 


5 


1 1 


5 


A 

u 


A 

u 


1 

z 


& 4AAAA1*>f A A - 1 IA 1 

M00001350A:H01 


7187 


5 


1 

3 


1 


A 
U 


1 


A 

u 


V A AAAA 11^1 A AO 

M0000l35lB:A08 


1 1 ^i 
3162 


1 A 

10 


1 A 

14 


1 


0 


O 


c 


X /AAAA 1 Of 1 r* . A /\0 

M00001351B:A08 


1 1 iCI 

3162 


1 A 

IU 


1 A 

14 


I 


0 


0 


c 
J 


X /AAAA * O A A . I - ' A A 


16245 


A 

4 


A 

0 


A 
U 


A 

u 


A 

u 


A 

u 


\ 4 A A A A 1 CI A . /~' 1 A 

M00001353A:G12 


8078 


A 

4 


i 
3 


1 


A 
U 


1 
1 


A 

u 


» iAAAA 1 *» C1T"\.r*\1 A 

M00001353D:D10 


14929 


A 

4 


A 
U 


A 
U 


1 
1 


ZJ 


i a 


X jf AAAA 1 ./^» 1 A 

M00001355B:G10 


14391 


3 


1 
I 


A 
U 


A 
U 


A 

u 


A 
U 


t JAAAA1 1 C7A.A 1 1 

M00001357D:D11 


a ncn 

4059 


n 

5 


/: 
0 


O 

o 


1 A 
10 


A 
U 


1 
1 


X A AAAA llift A . A AC 

MOUOU 1 36 1 A : A05 


A 1 A 1 

4141 


c 

J 


i 
Z 


in 

IU 


1 A 
10 




01 


X jf AAAA1 1 ZT 1 HiPAO 

M00001361D:F08 


inn 

2379 


2o 


1 1 

13 


A 

4 


i 

z 


z 




X JfAAAAl 1iC1D.ni A 

MUUUU I 362B :U 1 U 


DOZZ 


1 


4 


1 
Z 


1 1 


1 

1 


0 


XvfAAAAl lOPiU 1 1 

MUUUU I Jozt»,:rl 1 1 


y4j 


A 

y 


Z 1 


i 
Z 


1 

1 


A 
u 


0 


X JlAAAA IH^C/^./^IA 

MUUUU 13o5C:dU 


/1A1 n 

4U1 3Z 


i 

2 


A 
U 


A 
U 


A 
U 


*3 
J 


n 


AAAAAA 1 T*7A A ./"*A0 

MUUUU 1 3 /UA:L-Uy 


OoO/ 


/ 


i 
J 


A 
U 


A 

yj 


A 

V 




ivvf aaaa i n t p.cno 
MUUUU 13/1 CtiUy 


TITO 
/ 1 /Z 


1 

3 


c 


J 


9 


A 

V 


1 


ivaaaaai n&T>>f~i(\& 
MUUUU 13 /Od, UUo 


1 / /3Z 


i 
1 


1 
j 


C 

J 


A 
1/ 


1 


4 


XjIAAAAI nOD«DAO 

MUUUU 1 3 /od.dUz 


3V533 


i 
Z 


A 
U 


A 
U 


A 
U 


A 
\J 


o 


XvlAAAAl no A . A AC 

MUUUU 1 379A: AU5 


1 11 A 

1334 


z/ 


Jo 


JJ 


Zo 


1 
J 


A 
\J 


AvfAAAAl 1QATV13A0 

MUUUU 1 3o\)U.D\)y 


3Vooo 


1 
Z 


A 
U 


A 


A 

\J 


A 
V/ 


o 

V 


AAAAAA1 1Q0r*> A AO 

MUUUUl 38ZC AUZ 


2.2.y !y 


1 

z 


1 
1 


A 
U 


A 
W 


A 


o 

\J 


XXAAAA1 lOI A .r'AI 

M00001383A:C03 


39648 


1 

2 


A 
U 


A 

u 


A 
U 


A 
U 


A 

u 


M00O01383A:CO3 


39648 


2 


0 


0 


0 


0 


0 


M00001386C:B12 


5178 


5 


5 


4 


2 


5 


2 


M00001387A:C05 


2464 


5 


19 


25 


16 


1 


0 


M00001387B:G03 


7587 


6 


2 


1 


0 


0 


0 


M00001388D:G05 


5832 


10 


3 


0 


1 


5 


0 


M00001389A:C08 


16269 


3 


0 


0 


0 


1 


1 


M00001394A:F01 


6583 


2 


7 


3 


2 


0 


0 


M00001395A:C03 


4016 


5 


14 


0 


6 


0 


0 


M00001396A:C03 


4009 


6 


4 


13 


5 


4 


10 


M00001402A:E08 


39563 


2 


0 


0 


0 


0 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones 




ID 


Libl 


Lib2 


Lib3 


Lib4 


Lib8 


¥ :la 

Lid9 


M00001407B:D11 


5556 


o 

8 


1 


5 


0 


z 


u 


M00001409C:D12 


9577 


5 


2 


0 


i 
1 


1 1 
1 1 


1 0 


M000Ol4l0A:D07 


7005 


8 


2 


0 


0 


A 


A 

u 


M00001412B:B10 


855 1 


4 


4 


0 


3 


A 


A 
U 


M00001415A:H06 


13538 


5 


0 


0 


0 


9 


I 


M00001416A:H01 


7674 


5 


2 


0 


5 


0 


A 


M00001416B:H11 


8847 


4 


1 


3 


0 


6 


1 


M00001417A:E02 


36393 


2 


0 


0 


1 


A 

0 


A 

0 


M0000l4l8B:F03 


9952 


4 


2 


1 


1 


0 


A 

0 


M00001418D:B06 


8526 


3 


2 


1 


5 


1 


0 


M00001421C:F01 


9577 


5 


2 


0 


1 


11 


12 


M00001423B:E07 


15066 


4 


0 


0 


0 


0 


0 


MO0001424B:G09 


10470 


5 


1 


0 


2 


0 


1 


M00001425B:H08 


22195 


3 


0 


0 


0 


0 


0 


M00001426D:C08 


426 1 


4 


9 


7 


9 


12 


15 


M00001428A:H10 


84182 


1 


0 


0 


0 


0 


0 


M00001429A:H04 


2797 


15 


11 


18 


16 


1 


14 


M00001429B:A11 


4635 


7 


9 


2 


0 


0 


0 


M00001429D:D07 


40392 


2 


0 


1 


8 


12 


16 


M00001439C:F08 


40054 


1 


0 


0 


0 


0 


0 


M00001442C:D07 


16731 


3 


1 


0 


0 


0 


0 


M00001445A:F05 


13532 


3 


2 


1 


0 


1 


2 


M00001446A:F05 


7801 


5 


2 


4 


6 


1 


0 


M00001447A:G03 


10717 


7 


2 


0 


5 


8 


0 


MO00O1448D:C09 


8 


1850 


2127 


1703 


3133 


1355 


122 


M00001448D:H01 


363 13 


2 


0 


0 


0 


1 


*i a 

30 


M00001449A:A12 


5857 


6 


2 


3 


4 


0 


A 


M00001449A;B12 


41633 


1 


1 


0 


0 


0 


A 

0 


M00001449A:D12 


3681 


12 


5 


10 


1 


2 


5 


M00001449A:G10 


36535 


2 


0 


0 


0 


A 

0 


A 


M00001449C:D06 


861 10 


1 


0 


0 




A 

u 


A 
U 


M00001450A:A02 


39304 


2 


0 


a 
0 


A 
0 


A 
U 


A 

u 


K 4 AAA A 1 jicn a . i 11 

M00001450A:A11 


32663 


i 
1 


1 

1 


A 

0 


A 
U 


A 
U 


A 

u 


\ K r\f\f\C\ % A CA A .T» 1 ^ 

M00001450A:B12 


oi a no 

82498 


i 
1 


0 


A 

0 


A 
0 


A 
U 


A 
U 


M00001450A:D08 


27250 


2 


0 


a 

0 


A 
0 


A 
U 


A 
U 


1 lAAAA 1 AO A .T"*A/I 

M00001452A:B04 


84328 


i 
1 


A 

0 


A 

0 


A 
0 


A 
U 


A 
U 


M00001452A:B12 


86859 


1 


0 


0 


A 

0 


A 


A 
U 


M00001452A:D08 


1120 


44 


41 


5 


1 1 

11 


5 


A 
0 


M0000l452A:F05 


85064 


1 


0 


0 


A 

0 


A 

0 


A 
U 


M0000l452C:B06 


16970 


4 


0 


0 


0 


3 


4 


M0000l453A:Ell 


16130 


3 


1 


0 


0 


0 


1 


M00001453C:F06 


16653 


3 


1 


0 


0 


0 


0 


MO000l454A:AO9 


83103 


1 


0 


0 


0 


0 


0 


M00001454B:C12 


7005 


8 


2 


0 


0 


0 


0 


MO000l454D:GO3 


689 


58 


95 


17 


36 


66 


95 


MO0OOU55A:E09 


13238 


4 


1 


0 


0 


0 


0 


M0000l455B:El2 


13072 


4 


1 


0 


0 


0 


0 


M00001455D:F09 


9283 


4 


1 


0 


1 


0 


1 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones 




ID 


Libl 


Lib2 


Lib3 


Lib4 


Lib8 


Lib9 


M00001455D:F09 


9283 


4 


1 


0 


1 


0 


1 


M00001460A:F06 


2448 


23 


22 


2 


3 


3 


1 


M00001460A:FI2 


39498 


2 


0 


0 


0 


0 


0 


M00001461A:D06 


1531 


20 


23 


32 


17 


14 


14 


M00001463C:B11 


19 


1415 


1203 


1364 


525 


479 


774 


M00001465A:B11 


10145 


2 


0 


2 


0 


0 


0 


M00001466A:E07 


4275 


11 


2 


5 


0 


4 


2 


M00001467A:B07 


38759 


2 


0 


0 


0 


1 


1 


M00001467A:D04 


39508 


2 


0 


0 


0 


0 


0 


M00001467A:D08 


16283 


3 


0 


0 


0 


0 


0 


M00001467A:D08 


16283 


3 


0 


0 


0 


0 


0 


M00001467A:E10 


39442 


2 


0 


0 


0 


0 


0 


M00001468A:F05 


7589 


6 


2 


1 


1 


1 


0 


M00001469A:C10 


12081 


4 


0 


0 


0 


0 


0 


M00001469A:H12 


19105 


2 


0 


2 


0 


1 


0 


M00001470A:B10 


1037 


53 


48 


4 


22 


0 


0 


M00001470A:C04 


39425 


2 


0 


0 


0 


0 


0 


M00001471A:B01 


39478 


2 


0 


0 


0 


0 


0 


M00001481D:A05 


7985 


3 


1 


4 


0 


1 


0 


M00001490B:C04 


18699 


2 


1 


0 


0 


0 


3 


M00001494D:F06 


7206 


4 


3 


3 


1 


2 


0 


M00001497A:G02 


2623 


12 


4 


31 


4 


6 


1 


M00001499B:A11 


10539 


2 


1 


1 


0 


1 


0 


M00001500A:C05 


5336 


9 


2 


4 


8 


3 


15 


M00001500A:E11 


2623 


12 


4 


31 


4 


6 


1 


M00001500C:E04 


9443 


4 


2 


1 


1 


0 


0 


M00001501D:C02 


9685 


3 


2 


0 


7 


2 


3 


M00001504C:A07 


10185 


5 


1 


0 


0 


2 


4 


M00001504C:H06 


6974 


7 


3 


0 


1 


0 


0 


M00001504D:G06 


6420 


8 


0 


2 


0 


1 


0 


M00001507A:H05 


39168 


2 


0 


0 


0 


0 


0 


M00001511A:H06 


39412 


2 


0 


0 


0 


0 


0 


M00001512A:A09 


39186 


2 


0 


0 


0 


0 


0 


M00001512D:G09 


3956 


9 


9 


5 


2 


0 


0 


MOO001513A:BO6 


4568 


10 


4 


0 


9 


2 


0 


M00001513C:E08 


14364 


1 


0 


0 


0 


0 


0 


M00001514C:D11 


40044 


2 


0 


0 


0 


0 


0 


M00001517A:B07 


4313 


13 


6 


1 


0 


1 


0 


M00001518C:B11 


8952 


3 


4 


0 


4 


2 


0 


M00001528A:C04 


7337 


4 


4 


3 


16 


12 


21 


M00001528A:F09 


18957 


3 


0 


0 


0 


0 


0 


M00001528B:H04 


8358 


3 


3 


2 


0 


0 


0 


M00001531A:D01 


38085 


2 


0 


0 


0 


0 


0 


M00001532B:A06 


3990 


6 


12 


4 


1 


3 


1 


M0O001533A:Cll 


2428 


14 


14 


13 


9 


2 


19 


M00001534A:C04 


16921 


4 


0 


0 


1 


2 


1 


M00001534A:D09 


5097 


6 


5 


1 


1 


3 


2 


M00001534A:F09 


5321 


11 


7 


1 


5 


10 


26 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 




ID 


Libl 


Lib2 


Lib3 


Lib4 


Lib8 


Lib9 


M00001534C:A01 


4119 


9 


4 


2 


2 


5 


3 


M00001535A:B01 


7665 


3 


1 


5 


0 


0 


A 
0 


M00001535A:C06 


20212 


2 


0 


1 


1 


0 


A 

0 


M00001535A:F10 


39423 


2 


0 


0 


0 


A 


A 

u 


M00001536A:B07 


2696 


23 


11 


9 


18 


"I A 

10 


21 


M00001536A:C08 


39392 


2 


0 


0 


0 


0 


A 

0 


M00001537A:F12 


39420 


2 


0 


0 


0 


0 


A 

0 


M00001537B:G07 


3389 


4 


11 


13 


2 


0 


0 


M00001540A:D06 


8286 


6 


1 


0 


3 


4 


0 


M00001541A:D02 


3765 


19 


6 


0 


0 


0 


0 


M00001541A:F07 


22085 


3 


0 


0 


0 


0 


1 


M00001541A:H03 


39174 


2 


0 


0 


0 


0 


0 


M00001542A:A09 


22113 


3 


0 


0 


0 


0 


0 


M00001542A:E06 


39453 


2 


0 


0 


0 


0 


0 


M00001544A:E03 


12170 


2 


1 


2 


0 


0 


0 


M00001544A:G02 


19829 


2 


0 


1 


0 


0 


0 


M00001544B:B07 


6974 


7 


3 


0 


I 


0 


0 


M00001545A:C03 


19255 


2 


0 


0 


0 


0 


0 


M00001545A:D08 


13864 


3 


0 


2 


1 


2 


4 


M00001546A:G11 


1267 


43 


55 


5 


0 


0 


0 


M00001548A:E10 


5892 


5 


1 


4 


4 


1 


3 


M00001548A:H09 


1058 


40 


44 


37 


47 


39 


59 


M00001549A:B02 


4015 


10 


5 


8 


15 


2 


0 


M00001549A:D08 


10944 


3 


0 


3 


1 


0 


7 


M00001549B:F06 


4193 


12 


7 


2 


2 


0 


1 


M00001549C:E06 


16347 


4 


0 


0 


0 


0 


0 


M00001550A:A03 


7239 


5 


2 


1 


0 


2 


A 

0 


MQ0001550A:G01 


5175 


8 


1 


3 


2 


0 


A 

0 


M00001551A:B10 


6268 


6 


4 


3 


18 


5 


A 

0 


M0OO01551A:F05 


39180 


2 


0 


0 


0 


0 


A 


M00001551A:G06 


22390 


2 


1 


0 


0 


0 


1 

1 


M00001551C:G09 


3266 


12 


14 


0 


1 


A 

0 


o 


M00001552A:B12 


307 


73 


60 


196 


75 


*7A 

79 


27 


M00001552A:D11 


39458 


2 


0 


0 


A 


A 


A 
U 


M00001552B:D04 


5708 


5 


4 


4 


3 


1 


4 


M00001553A:H06 


8298 


4 


3 


1 


3 


0 


A 

0 


M00001553B:F12 


4573 


5 


7 


2 


5 


A 


1 


M00001553D:D10 


22814 


3 


0 


0 


0 


A 

0 


A 

u 


M00001555A:B02 


39539 


2 


0 


0 


A 


1 


A 
U 


M00001555A:C01 


39195 


2 


0 


0 


0 


0 


0 


M00001555D:G10 


4561 


8 


4 


4 


8 


0 


0 


M00001556A:C09 


9244 


2 


0 


3 


2 


10 


17 


M00001556A:F11 


1577 


12 


40 


25 


3 


4 


0 


M00001556A:H01 


15855 


2 


1 


1 


2 


12 


213 


M00001556B:C08 


4386 


7 


8 


3 


1 


3 


21 


M00001556B:G02 


11294 


4 


0 


2 


0 


0 


1 


M00001557A:D02 


7065 


5 


3 


2 


1 


0 


0 


M00001557A:D02 


7065 


5 


3 


2 


1 


0 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones 




ID 


Libl 


Lib2 


Lib3 


Lib4 


Libo 


Lioy 


M00001557A:F01 


9635 


3 


0 


2 


1 


A 
U 


r\ 

yj 


M00001557A:F03 


39490 


2 


0 


A 

0 


u 


1 
I 


u 


M00001557B;H10 


5192 


8 


5 


A 

0 


c 
J 


A 
U 


u 


M00001557D:D09 


8761 


3 


4 


0 


1 


A 
U 


J 


M00001558B:H11 


7514 


5 


3 


0 


A 

0 


A 


A 

u 


M00001560D:F10 


6558 


4 


3 


4 


0 


A 

0 


C 

J 


M00001561A:C05 


39486 


2 


0 


0 


0 


A 

0 


f\ 

u 


M00001563B:F06 


102 


289 


233 


278 


116 


123 


184 


M00001564A:B12 


5053 


1 1 


4 


2 


2 


1 


1 

I 


M00001571C:H06 


5749 


4 


1 


9 


0 


A 


A 

U 


M00001578B:E04 


23001 


2 


1 


0 


2 


A 

0 


A 
U 


M00001579D:C03 


6539 


8 


3 


0 


0 


A 

0 


1 
1 


M00001583D:A10 


6293 


3 


5 


2 


6 


A 

0 


A 
0 


M00001586C:C05 


4623 


3 


4 


12 


2 


1 


1 


M00001587A:B11 


39380 


2 


0 


0 


0 


0 


A 

0 


M00001594B:H04 


260 


189 


188 


27 


2 


15 


0 


M00001597C:H02 


4837 


6 


2 


10 


0 


3 


1 


M00001597D:C05 


10470 


5 


1 


0 


2 


0 


l 


M00001598A:G03 


16999 


4 


0 


0 


0 


A 

0 


A 

u 


M00001601A:D08 


22794 


2 


0 


0 


0 


A 

0 




M00001604A:B10 


1399 


49 


27 


19 


7 


1 A 

iu 




M00001604A:F05 


39391 


2 


0 


0 


0 


A 


A 
U 


M00001607A:E11 


11465 


5 


0 


0 


0 


A 

0 


A 
U 


M00001608A:B03 


7802 


5 


4 


0 


1 


A 

0 


A 
0 


M00001608B:E03 


22155 


3 


0 


0 


0 


A 

0 


A 
U 


M00001614C:F10 


13157 


4 


1 


0 


3 


1 

1 


A 
U 


M00001617C:E02 


17004 


4 


0 


1 


0 


1 

1 


A 
U 


M00001619C:F12 


40314 


2 


0 


0 


A 


1 

I 


A 
U 


M00001621C:C08 


40044 


2 


0 


0 


A 


A 

0 


A 
U 


M00001623D:F10 


13913 


2 


1 


2 


A 

0 


n 
U 


1 
J 


M00001624A:B06 


3277 


10 


1 1 


o 

8 


3 


c 

J 


1 


M00001624C:F01 


4309 


4 


13 


3 


i fi 
1U 


A 
U 


n 
u 


M00001630B:H09 


5214 


10 


2 


2 


I 


A 

4 


-> 


M00001644C:B07 


39171 


2 


0 


A 
0 


u 


A 
U 


A 


M00001645A:C12 


19267 


2 


A 

0 




A. 

u 


A 
U 


J 


M00001648C:A01 


4665 


5 


9 


A 

0 


A 

u 


A 
U 


A 
U 


M00001657D:C03 


23201 


3 


0 


A 

0 


A 

u 


J 


A 
U 


M00001657D:F08 


76760 


1 


0 


2 


I 


A 
U 


c 
D 


M00001662C:A09 


23218 


3 


A 

0 


A 

0 


A 
U 


A 
U 


A 

U 


M00001663A:E04 


35702 


2 


0 


0 


0 


A 

0 


A 

u 


M00001669B:F02 


6468 


4 


3 


3 


8 


1 


0 


M00001670C:H02 


14367 


3 


0 


0 


0 


0 


0 


M00001673C:H02 


7015 


6 


3 


1 


2 


1 


1 


M00001675A:C09 


8773 


4 


1 


4 


4 


4 


6 


M00001676B:F05 


114.60 


4 


2 


0 


0 


0 


0 


M00001677C:E10 


14627 


1 


2 


1 


0 


1 


0 


M00001677D:A07 


7570 


5 


3 


0 


0 


0 


0 


M00001678D:F12 


4416 


9 


5 


2 


6 


1 


3 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones 




ID 


Libl 


Lib2 


Lib3 


Lib4 


Lib* 


Lib9 


M00001679A:A06 


6660 


7 


0 


4 


2 


1 


0 


M00001679A:F10 


26875 


1 


0 


0 


0 


1 


0 


M00001679B:F01 


6298 


2 


4 


5 


3 


1 


0 


M00001679C:F01 


78091 


1 


0 


0 


0 


0 


0 


M00001679D:D03 


10751 


3 


2 


0 


1 


0 


1 


M00001679D:D03 


10751 


3 


2 


0 


1 


0 


1 


M00001680D:F08 


10539 


2 


1 


1 


0 


1 


0 


M00001682C:B12 


17055 


4 


0 


0 


0 


0 


0 


M00001686A:E06 


4622 


7 


6 


4 


2 


3 


0 


M00001688C:F09 


5382 


6 


2 


6 


2 


0 


3 


M00001693C:G01 


4393 


10 


6 


2 


4 


1 


1 


M00001716D:H05 


67252 


1 


0 


0 


1 


0 


0 


M00003741D:C09 


40108 


2 


0 


0 


0 


0 


0 


M00003747D:C05 


11476 


6 


0 


0 


0 


0 


0 


M00003759B:BO9 


697 


76 


52 


30 


72 


21 


30 


M00003762C:B08 


17076 


4 


0 


0 


0 


0 


0 


M00003763A:F06 


3108 


14 


11 


7 


5 


0 


1 


M00003774C:A03 


67907 


1 


0 


0 


0 


0 


0 


M00003796C:D05 


5619 


3 


5 


3 


3 


0 


4 


M00003826B:A06 


11350 


3 


3 


0 


0 


1 


0 


M00003833A:E05 


21877 


2 


1 


0 


0 


0 


1 


M00003837D:A01 


7899 


5 


4 


0 


2 


1 


0 


M00003839A:D08 


7798 


5 


2 


2 


0 


0 


1 


M00003844C:B11 


6539 


8 


3 


0 


0 


0 


1 


M00003846B:D06 


6874 


6 


3 


0 


0 


0 


0 


M00003851B:D10 


13595 


4 


0 


1 


0 


0 


1 


M00003853A:D04 


5619 


3 


5 


3 


3 


0 


4 


M00003853A:F12 


10515 


5 


1 


0 


1 


1 


2 


M00003856B:C02 


4622 


7 


6 


4 


2 


3 


0 


M00003857A:G10 


3389 


4 


11 


13 


2 


0 


0 


M00003857A:H03 


4718 


4 


5 


5 


2 


4 


6 


M00003871C:E02 


4573 


5 


7 


2 


5 


0 


1 


M00003875B:F04 


12977 


5 


0 


0 


0 


0 


0 


M000O3875B:F04 


12977 


5 


0 


0 


0 


0 


0 


M00003875C:G07 


8479 


4 


3 


1 


1 


2 


4 


M00003876D:E12 


7798 


5 


2 


2 


0 


0 


1 


M00003879B:C11 


5345 


7 


1 


7 


4 


6 


27 


M00003879B:D10 


31587 


1 


1 


0 


0 


1 


0 


M00003879D:A02 


14507 


3 


1 


0 


0 


3 


1 


M00003885C:A02 


13576 


5 


0 


0 


0 


12 


11 


M00003885C:A02 


13576 


5 


0 


0 


0 


12 


11 


M00003906C:E10 


9285 


4 


3 


0 


0 


1 


2 


M00003907D:A09 


39809 


1 


0 


0 


0 


2 


1 


M00003907D:H04 


16317 


3 


0 


0 


0 


0 


0 


M00003909D:C03 


8672 


4 


4 


0 


0 


0 


0 


M00003912B:D01 


12532 


4 


1 


0 


1 


0 


1 


M00003914C:F05 


3900 


9 


6 


8 


1 


7 


13 


M00003922A:E06 


23255 


3 


0 


1 


1 


0 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 




ID 


Libl 


Lib2 


Lib3 


Lib4 


Lib8 


Lib9 


M00003958A:H02 


18957 


3 


0 


0 


0 


0 


0 


M00003958A:H02 


18957 


3 


0 


0 


0 


0 


0 


M00003958C:G10 


40455 


2 


0 


0 


0 


0 


0 


M00003958C:G10 


40455 


2 


0 


0 


0 


0 


0 


M00003968B:F06 


24488 


2 


0 


1 


4 


0 


0 


M00003970C:B09 


40122 


2 


0 


0 


0 


0 


0 


M00003974D:E07 


23210 


3 


0 


0 


0 


0 


0 


M00003974D;H02 


23358 


3 


0 


0 


0 


1 


0 


M00003975A:G11 


12439 


4 


0 


0 


0 


0 


0 


M00003978B:G05 


5693 


7 


4 


1 


3 


1 


1 


M00003981A:E10 


3430 


9 


10 


7 


3 


0 


0 


M00003982C:C02 


2433 


10 


13 


21 


18 


8 


8 


M00003983A:A05 


9105 


5 


1 


1 


1 


0 


0 


M00004028D:A06 


6124 


4 


8 


1 


9 


1 


0 


M00004028D:C05 


40073 


2 


0 


1 


0 


0 


1 


M00004031A:A12 


9061 


5 


2 


0 


0 


0 


0 


M00004031A:A12 


906 1 


5 


2 


0 


0 


0 


0 


M00004035C:A07 


37285 


2 


0 


0 


1 


0 


1 


M00004035D:B06 


17036 


4 


0 


0 


0 


0 


0 


M00004059A:D06 


5417 


10 


4 


0 


9 


2 


0 


M00004068B:A01 


3706 


7 


14 


4 


22 


1 


0 


M00004072B:B05 


17036 


4 


0 


0 


0 


0 


0 


M00004081C:D10 


1 5069 


3 


0 


0 


1 


0 


0 


M00004081C:D12 


14391 


3 


1 


0 


0 


0 


0 


M00004086D:G06 


9285 


4 


3 


0 


0 


1 


2 


M00004087D:A01 


6880 


2 


6 


1 


1 


0 


0 


M00004093D:B12 


5325 


5 


5 


2 


0 


2 


1 


M00004093D:B12 


5325 


5 


5 


2 


0 


2 


1 


M00004105C:A04 


722 1 


5 


2 


2 


2 


0 


0 


M00004108A:E06 


4937 


4 


9 


3 


1 


3 


1 


M00004111D:A08 


6874 


6 


3 


0 


0 


0 


0 


M00004114C:F11 


13183 


2 


3 


0 


7 


0 


1 


M00004138B:H02 


1 3272 


3 


2 


0 


3 


0 


0 


M00004146C:C11 


5257 


2 


8 


5 


5 


5 


25 


M00004151D:B08 


16977 


4 


0 


0 


0 


0 


0 


M00004157C:A09 


6455 


3 


1 


6 


0 


0 


0 


M00004169C:C12 


5319 


6 


2 


8 


2 


2 


3 


M00004171D:B03 


4908 


6 


7 


2 


2 


2 


0 


M00004172C:D08 


ll 494 


4 


0 


0 


0 


0 


0 


M00004 1 83C:D07 


16392 


3 


0 


0 


0 


0 


0 


M00004l85C:C03 


H443 


5 


1 


0 


0 


0 


0 


M00004l97D:HOl 


8210 


2 


6 


0 


0 


0 


0 


M00004203B:Cl2 


I43ll 


4 


0 


0 


0 


1 


2 


M000042l2B:C07 


2379 


26 


13 


4 


2 


2 


3 


M000042l4C:H05 


H451 


3 


2 


1 


2 


1 


1 


M00004223A:GlO 


16918 


4 


0 


0 


0 


0 


0 


M00004223B:D09 


7899 


5 


4 


0 


2 


1 


0 


M00004223D:E04 


12971 


4 


0 


0 


0 


1 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones 




ID 


Libl 


Lib2 


M00004229B:F08 


6455 


3 


1 


M00004230B:C07 


7212 


3 


5 


M00004269D:D06 


4905 


7 


6 


M00G04275C:C11 


16914 


3 


0 


M00004283B:A04 


14286 


3 


1 


M00004285B:E08 


56020 


1 


0 


M00004295D:F12 


16921 


4 


0 


M00004296C:H07 


13046 


4 


1 


M00004307C:A06 


9457 


2 


0 


M00004312A:G03 


26295 


2 


0 


M00004318C:D10 


21847 


2 


1 


M00004372A:A03 


2030 


13 


10 


M00004377C:F05 


2102 


12 


20 



Clones in Clones in Clones in Clones in 



Lib3 


Lib4 


Lib8 


Lib9 


6 


0 


0 


0 


2 


1 


3 


0 


3 


1 


3 


1 


0 


1 


0 


0 


0 


1 


1 


1 


0 


0 


0 


0 


0 


1 


2 


1 


0 


1 


0 


0 


5 


0 


3 


0 


0 


0 


0 


0 


0 


0 


0 


0 


32 


4 


0 


0 


23 


21 


6 


5 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones ii 




ID 


LiblS 


Libl6b 


Libl7 


Libl8 


Libl9 


Lib20 


m « AAA A 1 *> A AT"* . A f\£ 

M00001340B:A06 


17062 


0 


0 


0 


A 

u 


A 

K) 




M00001340D:F10 


i i con 

1 1589 


0 


0 


A 

0 


A 

u 


A 
U 


u 


M00001341A:E12 


A A At 

4443 


A 

0 


A 

0 


A 

0 


1 

1 


A 

u 


n 

u 


M00001342B:E06 


39805 


0 


0 


A 

0 


A 
V 


A 
0 


A 

u 


M00001343C:F10 


2790 


0 


0 


0 


A 

0 


A 

0 


A 

u 


M00001343D:H07 


23255 


0 


0 


0 


A 

0 


A 

0 


A 

0 


M00001345A:E01 


6420 


0 


0 


0 


A 

0 


A 

0 


A 

0 


M00001346A:F09 


5007 


0 


0 


0 


0 


0 


A 

0 


M00001346D:E03 


6806 


0 


0 


0 


A 

0 


A 

0 


A 

0 


MO0001346D:G06 


5779 


0 


0 


0 


0 


0 


A 

0 


M00001346D:G06 


5779 


0 


0 


0 


0 


0 


A 

0 


M00001347A:B10 


13576 


0 


0 


0 


0 


0 


0 


M00001348B:B04 


16927 


0 


0 


0 


0 


0 


0 


M00001348B:G06 


16985 


0 


0 


0 


0 


0 


0 


M00001349B:B08 


3584 


0 


0 


0 


0 


0 


0 


M00001350A:H01 


7187 


0 


0 


0 


0 


0 


0 


M00001351B:A08 


3162 


0 


1 


0 


0 


1 


0 


M00001351B:A08 


3162 


0 


1 


0 


0 


1 


0 


M00001352A:E02 


16245 


0 


0 


0 


0 


0 


0 


M00001353A:G12 


8078 


0 


0 


0 


0 


0 


0 


M00001353D:D10 


14929 


0 


3 


1 


0 


5 


A 

0 


M00001355B:G10 


14391 


0 


0 


0 


0 


0 


A 

0 


M00001357D:D11 


4059 


0 


0 


0 


0 


0 


0 


M00001361A:A05 


4141 


0 


0 


0 


0 


0 


A 


M00001361D:F08 


2379 


0 


0 


0 


A 

0 


A 


A 


M00001362B:D10 


5622 


0 


0 


0 


A 


A 

0 


A 

u 


M00001362C:H11 


945 


0 


0 


0 


A 

0 


A 


1 

1 


M00001365C:C10 


40132 


0 


0 


0 


0 


0 


A 

0 


M00001370A:C09 


6867 


0 


0 


0 


0 


A 

0 


A 

0 


M00001371C:E09 


7172 


0 


0 


0 


A 

0 


A 


A 


M00001376B:G06 


17732 


0 


0 


0 


A 

0 


A 


1 

I 


M00001378B:B02 


39833 


0 


A 

0 


A 

0 


A 

0 


A 

0 


A 

u 


M00001379A:A05 


1334 


A 

0 


A 


A 


A 

u 


A 

0 


1 

1 


M00001380D:B09 


39886 


0 


A 

0 


A 

0 


A 

u 


A 

0 


A 
U 


M0O0O1382C:A02 


22979 


A 

0 


A 

0 


A 

0 


A 

0 


A 

u 


A 
U 


M00001383A:C03 


*> A^ A O 

39648 


A 

0 


0 


A 

0 


A 

0 


A 

u 


A 
U 


M0O001383A:C03 


HA/ A O 

39648 


A 

0 


A 

0 


A 

0 


A 

0 


A 

V 


A 
U 


M00001386C;BI2 


5178 


A 

0 


A 


A 


A 

0 


A 

u 


A 
U 


M00001387A:C05 


2464 


0 


A 

0 


A 

0 


A 

0 


0 


A 
0 


M00001387B:G03 


7587 


0 


0 


0 


0 


0 


0 


M0O0Ol388D:G05 


5832 


0 


0 


0 


0 


0 


0 


M00001389A:C08 


16269 


0 


1 


0 


0 


0 


0 


M0000l394A:F0l 


6583 


1 


4 


1 


0 


0 


0 


M00001395A:C03 


4016 


0 


0 


0 


0 


0 


0 


M00001396A:C03 


4009 


0 


0 


0 


0 


0 


0 


M0000l402A:E08 


39563 


0 


0 


0 


0 


0 


0 


M0O0Ol4O7B:Dll 


5556 


0 


0 


0 


0 


0 


0 


M0O0Ol4O9C:Dl2 


9577 


0 


0 


0 


0 


0 


0 



297 



WO 99/33982 



PCT/US98/27610 



Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones i 




ID 


LiblS 


Libl6b 


Libl7 


LiblS 


Libl9 


Lib20 


M00001410A:D07 


7005 


0 


0 


0 


0 


0 


0 


M000014I2B:B10 


8551 


0 


0 


0 


0 


0 


0 


M00001415A:H06 


13538 


0 


0 


0 


0 


0 


0 


M00001416A:H01 


7674 


0 


0 


0 


0 


0 


0 


M00001416B:H11 


8847 


0 


0 


0 


0 


0 


0 


M00001417A:E02 


36393 


0 


0 


0 


0 


0 


0 


M00001418B:F03 


9952 


0 


0 


0 


0 


0 


0 


M00001418D:B06 


8526 


0 


0 


0 


0 


0 


0 


M00001421C:F01 


9577 


0 


0 


0 


0 


0 


0 


M00001423B:E07 


15066 


0 


0 


0 


0 


0 


0 


M00001424B:G09 


10470 


0 


0 


0 


0 


0 


0 


M00001425B:H08 


22195 


0 


0 


0 


0 


0 


0 


M00001426D:C08 


4261 


0 


0 


1 


0 


0 


1 


M00001428A:H10 


84182 


0 


0 


0 


0 


0 


0 


M00001429A:H04 


2797 


0 


0 


0 


0 


0 


0 


M00001429B:A11 


4635 


0 


0 


0 


0 


0 


0 


M00001429D:D07 


40392 


0 


0 


0 


0 


0 


0 


M00001439C:F08 


40054 


0 


0 


0 


0 


0 


0 


M00001442C:D07 


16731 


0 


0 


0 


0 


0 


0 


M00001445A:F05 


13532 


0 


0 


0 


0 


0 


0 


M00001446A:F05 


7801 


0 


0 


0 


0 


0 


0 


M00001447A:G03 


10717 


0 


0 


0 


0 


0 


0 


M00001448D:C09 


8 


1 


6 


6 


1 


14 


1 


M00001448D:H01 


36313 


0 


3 


0 


0 


3 


0 


M00001449A:A12 


5857 


0 


0 


0 


0 


0 


0 


M00001449A:B12 


41633 


0 


0 


0 


0 


0 


0 


M00001449A:D12 


3681 


0 


0 


0 


0 


0 


0 


M00001449A:G10 


36535 


0 


0 


0 


0 


0 


0 


M00001449C:D06 


86110 


0 


0 


0 


0 


0 


0 


M00001450A:A02 


39304 


0 


0 


0 


0 


0 


0 


M00001450A:A11 


32663 


0 


0 


0 


0 


0 


0 


M00001450A:B12 


82498 


0 


0 


0 


0 


0 


0 


M00001450A:D08 


27250 


0 


0 


0 


0 


0 


0 


M00001452A:B04 


84328 


0 


0 


0 


0 


0 


0 


M00001452A:B12 


86859 


0 


0 


0 


0 


0 


0 


M00001452A:D08 


1120 


0 


0 


0 


0 


0 


0 


M00001452A:F05. 


85064 


0 


0 


0 


0 


0 


0 


M00001452C:B06 


16970 


0 


0 


2 


0 


1 


0 


M00001453A:E11 


16130 


0 


0 


0 


0 


0 


0 


M00001453C:F06 


16653 


0 


0 


0 


0 


0 


0 


M00001454A:A09 


83103 


0 


0 


0 


0 


0 


0 


M00001454B:C12 


7005 


0 


0 


0 


0 


0 


0 


M00001454D:G03 


689 


0 


2 


2 


0 


4 


2 


M00001455A:E09 


13238 


0 


0 


0 


0 


0 


0 


M00001455B:E12 


13072 


0 


0 


0 


0 


0 


0 


M00001455D:F09 


9283 


0 


0 


0 


0 


0 


0 


M00001455D:F09 


9283 


0 


0 


0 


0 


0 


0 


M00001460A:F06 


2448 


0 


0 


0 


0 


0 


0 


M00001460A:F12 


39498 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones i] 




ID 


Libl5 


Libl6b 


Libl7 


Libl8 


Libl9 


Lib20 


M00001461A:D06 


1531 


0 


0 


0 


0 


0 


V 


M00001463C:B11 


19 


2 


13 


13 


0 


69 


10 


M0000!465A:B11 


10145 


0 


0 


0 


0 


0 


0 


M00001466A:E07 


4275 


0 


0 


0 


0 


0 


0 


M00001467A:B07 


38759 


0 


0 


0 


0 


0 


0 


M00001467A:D04 


39508 


0 


0 


0 


0 


0 


0 


M00001467A:D08 


16283 


0 


0 


0 


0 


0 


0 


M00001467A:D08 


16283 


0 


0 


0 


0 


0 


0 


M00001467A:E10 


39442 


0 


0 


0 


0 


0 


0 


M00001468A:F05 


7589 


0 


0 


0 


0 


0 


0 


M00001469A:C10 


12081 


0 


0 


0 


0 


0 


0 


M00001469A:H12 


19105 


0 


0 


0 


0 


0 


0 


M00001470A:B10 


1037 


0 


0 


0 


0 


0 


0 


M00001470A:C04 


39425 


0 


0 


0 


0 


0 


0 


M00001471A:B01 


39478 


0 


0 


0 


0 


0 


0 


M00001481D:A05 


7985 


0 


0 


0 


0 


0 


0 


M00001490B:C04 


18699 


0 


0 


0 


0 


0 


0 


M00001494D:F06 


7206 


0 


0 


0 


0 


0 


0 


M00001497A:G02 


2623 


0 


0 


0 


0 


0 


0 


M00001499B:A11 


10539 


0 


0 


0 


0 


0 


0 


MOO001500A:CO5 


5336 


0 


0 


0 


0 


0 


0 


M00001500A:E11 


2623 


0 


0 


0 


0 


0 


0 


M00001500C:E04 


9443 


0 


0 


0 


0 


0 


0 


M00001501D:C02 


9685 


0 


0 


0 


0 


0 


0 


M00001504C:A07 


10185 


0 


0 


0 


0 


0 


0 


M00001504C:H06 


6974 


0 


0 


0 


0 


0 


0 


M00001504D:G06 


6420 


0 


0 


0 


0 


0 


0 


M00001507A:H05 


39168 


0 


0 


0 


0 


0 


0 


M0000151iA:H06 


39412 


0 


0 


0 


0 


0 


0 


M00001512A:A09 


39186 


0 


0 


0 


0 


0 


0 


M0O001512D:GO9 


3956 


0 


0 


1 


0 


0 


0 


M00001513A:B06 


4568 


0 


0 


0 


0 


0 


0 


M0O0O1513C:EO8 


14364 


0 


0 


0 


0 


0 


0 


M00001514C:D11 


40044 


0 


1 


0 


0 


0 


0 


M00001517A:B07 


4313 


0 


0 


0 


0 


0 


0 


M00001518C:B11 


8952 


0 


0 


0 


0 


0 


0 


M00001528A:C04 


7337 


0 


0 


0 


0 


0 


0 


M00001528A:F09 


18957 


0 


0 


0 


0 


0 


0 


M00001528B:H04 


8358 


0 


0 


0 


0 


0 


0 


MOO0O1531A:D01 


38085 


0 


0 


0 


0 


0 


0 


M00001532B:A06 


3990 


1 


1 


0 


0 


0 


0 


M00001533A:C11 


2428 


0 


0 


1 


0 


0 


0 


M00001534A:C04 


16921 


0 


0 


0 


0 


0 


0 


M00001534A:D09 


5097 


0 


0 


0 


0 


0 


0 


M0O0O1534A:F09 


5321 


0 


1 


0 


0 


2 


0 


M00001534C:A01 


4119 


0 


0 


0 


0 


0 


0 


M00001535A:B01 


7665 


0 


0 


0 


0 


0 


0 


M00001535A:C06 


20212 


0 


0 


0 


0 


0 


0 


M00001535A:F10 


39423 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones i 




ID 


LiblS 


Libl6b 


Libl7 


Libl8 


Libl9 


Lib20 


m IAAAA 1 r *^ f A ft? 

M00001536A:B07 


2696 


0 


0 




A 


J 


A 


M00001536A:C08 


39392 


0 


a 
0 


0 


A 


A 


A 


M0000l537A:Fl2 


39420 


0 


A 

u 


A 
V 


A 

u 


A 

u 


A 
U 


M00001537B:G07 


3389 


0 


0 


0 


A 


A 

V 


A 


M00001540A:D06 


8286 


A 

0 


A 

0 


A 
0 


A 

0 


0 


A 
0 


M00001541A:D02 


3765 


0 


0 


0 


A 

0 


A 

0 


A 

0 


M00001541A:F07 


22085 


0 


0 


0 


0 


0 


A 

0 


M00001541A:H03 


39174 


0 


0 


0 


0 


0 


0 


M00001542A:A09 


22113 


0 


0 


0 


0 


0 


0 


M00001542A:E06 


39453 


0 


0 


0 


0 


0 


0 


M00001544A:E03 


12170 


0 


0 


0 


0 


0 


0 


M00001544A:G02 


19829 


0 


0 


0 


0 


0 


0 


M00001544B:B07 


6974 


0 


0 


0 


0 


0 


0 


M00001545A:C03 


19255 


0 


0 


0 


0 


0 


0 


M00001545A:D08 


13864 


0 


0 


0 


0 


0 


0 


M00001546A:G11 


1267 


1 


0 


0 


0 


7 


0 


M00001548A:E10 


5892 


0 


0 


0 


0 


0 


0 


M00001548A:H09 


1058 


0 


0 


1 


0 


0 


0 


M00001549A:B02 


4015 


0 


0 


0 


0 


0 


0 


M00001549A:D08 


10944 


0 


0 


0 


0 


0 


0 


M00001549B:F06 


4193 


0 


0 


0 


0 


0 


0 


M00001549C:E06 


16347 


0 


0 


0 


0 


0 


0 


M00001550A:A03 


7239 


0 


0 


0 


0 


0 


0 


M00001550A:G01 


5175 


0 


0 


0 


0 


0 


0 


M00001551A:B10 


6268 


0 


0 


0 


0 


0 


0 


M00001551A:F05 


39180 


0 


0 


0 


0 


0 


0 


M00001551A:G06 


22390 


0 


0 


0 


0 


0 


0 


M00001551C:G09 


3266 


0 


0 


1 


0 


0 


0 


M00001552A:B12 


307 


0 


0 


0 


0 


3 


0 


M00001552A:D11 


39458 


0 


0 


0 


0 


0 


0 


M00001552B:E)04 


5708 


0 


1 


0 


0 


0 


0 


M00001553A:H06 


8298 


0 


0 


0 


0 


0 


0 


M00001553B:F12 


4573 


0 


0 


0 


0 


0 


0 


MO0O01553D:D10 


22814 


0 


0 


0 


0 


0 


0 


M00001555A:B02 


39539 


0 


0 


0 


A 

0 


A 

0 


A 

0 


M00001555A:C01 


39195 


0 


0 


0 


0 


0 


0 


M00001555D:G10 


4561 


0 


0 


0 


0 


0 


0 


M00001556A:C09 


9244 


0 


0 


0 


0 


0 


0 


M00001556A:F1 1 


1577 


0 


0 


0 


0 


0 


0 


M00001556A:H01 


15855 


3 


5 


5 


0 


3 


1 


M00001556B:C08 


4386 


1 


2 


0 


0 


0 


0 


M00001556B:G02 


11294 


0 


0 


0 


0 


0 


0 


M00001557A:D02 


7065 


0 


0 


0 


0 


0 


0 


M00001557A:D02 


7065 


0 


0 


0 


0 


0 


0 


M00001557A:F01 


9635 


0 


0 


0 


0 


0 


0 


M00001557A:F03 


39490 


0 


0 


0 


0 


0 


0 


M00001557B:H10 


5192 


0 


0 


0 


0 


0 


0 


M00001557D:D09 


8761 


0 


0 


0 


0 


0 


0 


M00001558B:H11 


7514 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 




ID 


LiblS 


Libl6b 


Libl7 


LiblS 


Libl9 


Lib20 


M00001560D:F10 


6558 


0 


0 


0 


0 


0 


0 


M00001561A:C05 


39486 


0 


0 


0 


0 


0 


0 


M00001563B:F06 


102 


22 


38 


65 


7 


43 


10 


M00001564A:B12 


5053 


0 


0 


1 


0 


0 


0 


M0000I571C:H06 


5749 


0 


0 


0 


0 


0 


0 


M00001578B:E04 


23001 


0 


0 


0 


0 


0 


0 


M0000I579D:C03 


6539 


0 


0 


0 


0 


0 


0 


M00001583D:A10 


6293 


0 


0 


0 


0 


0 


0 


M0O001586C:C05 


4623 


0 


0 


0 


0 


1 


0 


M00001587A:B11 


39380 


0 


0 


0 


0 


0 


0 


M00001594B:H04 


260 


0 


0 


0 


0 


1 


0 


M00001597C:H02 


4837 


0 


0 


0 


0 


0 


0 


M00001597D:C05 


10470 


0 


0 


0 


0 


0 


0 


M00001598A:G03 


16999 


1 


f 


1 


0 


0 


0 


M00001601A:D08 


22794 


0 


0 


0 


0 


0 


0 


M00001604A:B10 


1399 


0 


0 


0 


0 


0 


0 


M00001604A:F05 


39391 


0 


0 


0 


0 


0 


0 


M00001607A:E11 


11465 


0 


0 


0 


0 


0 


0 


M00001608A:B03 


7802 


0 


0 


0 


0 


0 


0 


M00001608B:E03 


22155 


0 


0 


0 


0 


0 


0 


M00001614C:F10 


13157 


0 


0 


0 


0 


0 


0 


M00001617C:E02 


17004 


0 


0 


0 


0 


1 


0 


M00001619C:F12 


40314 


0 


0 


0 


0 


0 


0 


M00001621C:C08 


40044 


0 


1 


0 


0 


0 


0 


M00001623D:F10 


13913 


0 


0 


0 


0 


0 


0 


M00001624A:B06 


3277 


0 


0 


0 


0 


0 


0 


M00001624C:F01 


4309 


0 


0 


0 


0 


0 


0 


M00001630B:H09 


5214 


1 


0 


0 


1 


1 


0 


M00001644C:B07 


39171 


0 


0 


0 


0 


0 


0 


M00001645A:C12 


19267 


0 


0 


0 


0 


1 


0 


M00001648C:A01 


4665 


0 


0 


0 


0 


0 


0 


M00001657D:C03 


23201 


0 


0 


0 


0 


0 


0 


M00001657D:F08 


76760 


0 


0 


0 


0 


0 


0 


M00001662C:A09 


23218 


0 


0 


0 


0 


0 


0 


M00001663A:E04 


35702 


0 


0 


0 


0 


0 


0 


M00001669B:F02 


6468 


0 


0 


0 


0 


0 


0 


M00001670C:H02 


14367 


0 


0 


0 


0 


0 


0 


m f Af\/\A i ^n** t TSVA 

M00001673C:H02 


7015 


0 


0 


0 


0 


0 


0 


M00001675A:C09 


8773 


0 


0 


0 


0 


0 


0 


M00001676B:F05 


1 1460 


0 


0 


0 


0 


0 


0 


M00001677C:E10 


14627 


0 


1 


0 


0 


0 


0 


M00001677D:A07 


7570 


0 


0 


0 


0 


0 


0 


M00001678D:F12 


4416 


0 


0 


0 


0 


0 


0 


M00001679A:A06 


6660 


0 


0 


0 


0 


0 


0 


M00001679A:F10 


26875 


0 


0 


0 


0 


0 


0 


M0O001679B:FOl 


6298 


0 


0 


0 


0 


0 


0 


M00001679C:F01 


78091 


0 


0 


0 


0 


0 


0 


M00001679D:D03 


10751 


0 


0 


0 


0 


0 


0 


M00001679D:D03 


10751 


0 


0 


0 


0 


0 


0 



301 



WO 99/33982 



PCT/US98/27610 



Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clone 




ID 


LiblS 


Libl6b 


Libl7 


Libl8 


Libl9 


Lib^ 


M00001680D:F08 


10539 


0 


0 


0 


0 


0 


0 


M00001682C:B12 


17055 


0 


0 


0 


0 


0 


0 


M00001686A:E06 


4622 


0 


0 


0 


0 


0 


0 


M00001688C:F09 


5382 


0 


0 


0 


0 


0 


0 


MO0001693C:G01 


4393 


0 


0 


0 


0 


0 


0 


M00001716D:H05 


67252 


0 


0 


0 


0 


0 


0 


M00003741D:C09 


40108 


0 


0 


0 


0 


0 


0 


MO0O03747D:CO5 


11476 


0 


0 


0 


0 


0 


0 


MO0O03759B:B09 


697 


0 


0 


0 


0 


1 


0 


M00003762C:B08 


17076 


0 


0 


0 


0 


0 


0 


M00003763A:F06 


3108 


0 


0 


0 


0 


0 


0 


M00003774C:A03 


67907 


0 


0 


0 


0 


0 


0 


MOOO03796C:D05 


5619 


0 


0 


0 


0 


0 


0 


M00003826B:A06 


11350 


0 


0 


0 


0 


0 


0 


M00003833A:E05 


21877 


0 


0 


0 


0 


0 


0 


MO0O03837D:A01 


7899 


0 


0 


0 


0 


0 


0 


MO0O03839A:D08 


7798 


0 


0 


0 


0 


0 


0 


M00003844C:B11 


6539 


0 


0 


0 


0 


0 


0 


M00003846B:D06 


6874 


0 


0 


1 


0 


0 


0 


M00003851B:D10 


13595 


0 


0 


0 


0 


0 


0 


M00003853A:D04 


5619 


0 


0 


0 


0 


0 


0 


M00003853A:F12 


10515 


0 


0 


0 


0 


0 


0 


M00003856B:C02 


4622 


0 


0 


0 


0 


0 


0 


M00003857A:G10 


3389 


0 


0 


0 


0 


0 


0 


M00003857A:H03 


4718 


0 


0 


0 


0 


0 


0 


M00003871C:E02 


4573 


0 


0 


0 


0 


0 


0 


M00003875B:F04 


12977 


0 


0 


0 


0 


0 


0 


MO00O3875B:FO4 


12977 


0 


0 


0 


0 


0 


0 


MO00O3875C:GO7 


8479 


0 


0 


0 


0 


0 


1 


M00003876D:E12 


7798 


0 


0 


0 


0 


0 


0 


M00003879B:CI1 


5345 


0 


0 


0 


2 


0 


1 


M00003879B:D10 


31587 


0 


0 


0 


0 


0 


0 


M00003879D:A02 


14507 


0 


0 


0 


0 


0 


0 


M000O3885C:A02 


13576 


0 


0 


0 


0 


0 


0 


M000O3885C:AO2 


13576 


0 


0 


0 


0 


0 


0 


M00003906C:E10 


9285 


0 


0 


0 


0 


0 


0 


M00003907D:A09 


39809 


0 


0 


0 


0 


0 


0 


M0Q003907D:HO4 


16317 


0 


0 


0 


0 


0 


0 


MOO0O3909D:CO3 


8672 


. 0 


0 


0 


0 


0 


0 


M00003912B:D01 


12532 


0 


0 


0 


0 


0 


0 


M00003914C:F05 


3900 


0 


0 


0 


0 


1 


0 


MO00O3922A:EO6 


23255 


0 


0 


0 


0 


0 


0 


M00003958A:H02 


18957 


0 


0 


0 


0 


0 


0 


M00003958A:H02 


18957 


0 


0 


0 


0 


0 


0 


M00003958C:G10 


40455 


0 


0 


0 


0 


0 


0 


MO0OO3958C:G10 


40455 


0 


0 


0 


0 


0 


0 


M00003968B:F06 


24488 


0 


0 


0 


0 


0 


0 


MO0003970C:BO9 


40122 


0 


0 


0 


0 


0 


0 


M00003974D:E07 


23210 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones i 




ID 


LiblS 


Libl6b 


Libl7 


Libl8 


Libl9 


Lib20 


M00003974D:H02 


23358 


0 


0 


0 


0 


0 


0 


M00003975A:G11 


12439 


0 


0 


0 


0 


0 


0 


M00003978B:G05 


5693 


0 


0 


0 


0 


0 


0 


M00003981A:E10 


3430 


0 


0 


0 


0 


1 


0 


M00003982C:C02 


2433 


0 


0 


0 


0 


0 


0 


M00003983A:A05 


9105 


0 


0 


0 


0 


0 


0 


M00004028D:A06 


6124 


0 


0 


0 


0 


0 


0 


M00004028D:C05 


40073 


0 


0 


0 


0 


0 


0 


M00004031A:A12 


9061 


0 


0 


0 


0 


0 


0 


M00004031A:A12 


9061 


0 


0 


0 


0 


0 


0 


M00004035C:A07 


37285 


0 


0 


0 


0 


0 


0 


M00004035D:B06 


17036 


0 


0 


0 


0 


0 


0 


M00004059A:D06 


5417 


0 


0 


0 


0 


0 


0 


M00004068B:A01 


3706 


0 


0 


0 


0 


0 


0 


M00004072B:B05 


17036 


0 


0 


0 


0 


0 


0 


M00004081C:D10 


15069 


0 


0 


0 


0 


0 


0 


M00004081C:D12 


14391 


0 


0 


0 


0 


0 


0 


M00004086D:G06 


9285 


0 


0 


0 


0 


0 


0 


M00004087D:A01 


6880 


0 


0 


0 


0 


0 


0 


M00004093D:B12 


5325 


1 


1 


0 


1 


0 


1 


M00004093D:B12 


5325 


1 


1 


0 


1 


0 


1 


M00004105C:A04 


7221 


0 


0 


0 


0 


0 


0 


M00004108A:E06 


4937 


0 


0 


0 


0 


0 


0 


M00004111D:A08 


6874 


0 


0 


1 


0 


0 


0 


M00004114C:F11 


13183 


0 


0 


0 


0 


0 


0 


M00004138B:H02 


13272 


0 


0 


0 


0 


0 


0 


M00004146C:C11 


5257 


0 


1 


0 


0 


0 


0 


M00004151D:B08 


16977 


0 


0 


0 


0 


0 


0 


M00004157C:A09 


6455 


0 


0 


0 


0 


0 


0 


M00004169C:C12 


5319 


0 


0 


0 


0 


0 


0 


M00004171D:B03 


4908 


0 


0 


0 


0 


0 


0 


M00004172C:D08 


11494 


0 


0 


0 


0 


0 


0 


M00004183C:D07 


16392 


0 


0 


0 


0 


0 


0 


M00004185C:C03 


11443 


0 


0 


0 


0 


0 


0 


M00004197D:H01 


8210 


0 


0 


0 


0 


0 


0 


M00004203B:C12 


14311 


0 


0 


0 


0 


0 


0 


M00004212B:C07 


2379 


0 


0 


0 


0 


0 


0 


M00004214C:H05 


11451 


0 


0 


0 


0 


0 


0 


M00004223A:G10 


16918 


0 


0 


0 


0 


0 


0 


M00004223B:D09 


7899 


0 


0 


0 


0 


0 


0 


M00004223D.E04 


12971 


0 


0 


0 


0 


0 


0 


M00004229B;F08 


6455 


0 


0 


0 


0 


0 


0 


M00004230B:C07 


7212 


0 


0 


0 


0 


0 


0 


M00004269D:D06 


4905 


0 


0 


0 


0 


0 


0 


M00004275C:C11 


16914 


0 


0 


0 


0 


0 


0 


M00004283B:A04 


14286 


0 


0 


0 


0 


0 


0 


M00004285B:E08 


56020 


0 


0 


0 


0 


0 


0 


M00004295D:F12 


16921 


0 


0 


0 


0 


0 


0 


M00004296C:H07 


13046 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 




ID 


LiblS 


M00004307C:A06 


9457 


0 


M00004312A:G03 


26295 


0 


M00004318C:D10 


21847 


0 


M00004372A:A03 


2030 


0 


M00004377C:F05 


2102 


0 



Clones in Clones in Clones in Clones in Clones in 



Libl6b 


Libl7 


Libl8 


Libl9 


Lib20 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Libl4 


% i AAAft 1 *i A ATI . A C\C 

M00001340B:A06 


17062 


0 


A 
0 


A 

u 


X 4AAAA1 O A nr\ . T 1 A 

M00001340D:F10 


i 1 f on 

1 1589 


A 

0 


A 
0 


A 
U 


M00001341A:E12 


A A A *> 

4443 


A 

4 


2 


A 
U 


MOO00 1 342B:E06 


T AO AC 

39805 


A 

0 


A 

0 


A 
U 


M00001343C:F10 


2790 


A 

0 


A 

0 


A 
U 


» «AAAA1 *i A O T\ . T T A *7 

M00001343D:H07 


23255 


A 

0 


A 

0 


A 
0 


X *A AAA 1 A C A - T* f\ 1 

M00001345A:E01 


6420 


0 


A 

0 


A 

0 


M00001346A:F09 


5007 


0 


0 


0 


M00001346D:E03 


6806 


0 


1 


1 


M00001346D:G06 


5779 


0 


0 


0 


M00001346D:G06 


5779 


0 


0 


0 


M00001347A:B10 


13576 


0 


0 


0 


M00001348B:B04 


1 6927 


0 


0 


0 


M00001348B:G06 


16985 


0 


0 


0 


M00001349B:B08 


3584 


0 


0 


0 


M0O00135OA:HOl 


7187 


0 


0 


0 


M0O001351B:AO8 


3162 


0 


. 0 


1 


M0O001351B:AO8 


3162 


0 


0 


1 


M00001352A:E02 


16245 


0 


0 


0 


M00001353A:G12 


8078 


0 


0 


0 


M00001353D;D10 


14929 


0 


1 


0 


M00001355B:G10 


14391 


0 


0 


0 


M00001357D:D11 


4059 


0 


0 


0 


"» f AAA g\ 4 ^ f % A A /V ^ 

M00001361A:A05 


4141 


1 


2 


1 

1 


M00001361D:F08 


2379 


0 


A 

0 


A 

0 


M00001362B:D10 


5622 


0 


2 


1 

1 


■V XAAAA 1 1 f% V T t i 

M00001362C:H11 


945 


0 


A 

0 


A 

0 


X if AAA A 1 "> Z' C Z**' . 1 A 

M00001365C:C10 


JA1 Tl 

40132 


A 

0 


A 

0 


A 

0 


X jf AAAA 1 11A A .PAft 

M00001370A:C09 


6867 


A 

0 


A 

0 


A 

u 


X jTAAAA 1 TT1 O.PAA 

M00001371C:E09 


7172 


A 

0 


A 
0 


1 

1 


X X AAA A 1 ""1 T~» . A /" 

M00001376B:G06 


17732 


2 


A 
0 


A 

0 


M00001378B;B02 


39833 


A 

0 


A 
0 


A 

0 


X fAAAAl nA A . A 

M00001379A:A05 


1334 


A 

0 


A 

0 


A 

0 


X / A AAA 1 ATA A A 

M00001380D:B09 


39886 


0 


A 

0 


A 

0 


X M AAAA 1 0*^/™< A A A 

M00001382C:A02 


22979 


1 


A 

0 


A 

0 


X # A A A A 1 OO A /^t/\*> 

MO0O0l383A:CO3 


39648 


A 

0 


A 

0 


A 

0 


X # A A A A 1 ^ Ct^ A A 

MO0001383A:CO3 


39648 


0 


A 

0 


A 

0 


M00001386C:B12 


5178 


0 


0 


0 


M00001387A:C05 


2464 


0 


0 


0 


M00001387B:G03 


7587 


0 


0 


0 


M00001388D:G05 


5832 


0 


0 


0 


M00001389A:C08 


16269 


2 


0 


0 


M00001394A:F01 


6583 


0 


0 


0 


M00001395A:C03 


4016 


0 


0 


0 


M00001396A:C03 


4009 


2 


0 


0 


M00001402A:E08 


39563 


0 


0 


0 


M00001407B:D11 


5556 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 

M00001409C:D12 
M00001410A:D07 
M00001412B:B10 
M00001415A:H06 
M00001416A:H01 
M00001416B:H11 
M00001417A:E02 
M00001418B:F03 
M00001418D.B06 
M00001421C:F01 
M00001423B:E07 
M00001424B:G09 
M00001425B:H08 
M00001426D:C08 
M00001428A:H10 
M00001429A:H04 
M00001429B:A11 
M00001429D:D07 
M00001439C:F08 
M00001442C:D07 
M00001445A:F05 
M00001446A:F05 
M00001447A:G03 
M00001448D:C09 
M00001448D:H01 
M00001449A:A12 
M00001449A:B12 
M00001449A-.D12 
M00001449A:G10 
M00001449C:D06 
M00001450A:A02 
M00001450A:A11 
M00001450A:B12 
M00001450A:D08 
M00001452A:B04 
M00001452A:B12 
M00001452A:D08 
M00001452A:F05 
M00001452C:B06 
M00001453A:E11 
M00001453C:F06 
M00001454A:A09 
M00001454B:C12 
M00001454D:G03 
M00001455A:E09 
MO0001455B:E12 
M00001455D:F09 
M00001455D:F09 



Cluster ID Clones in 





Libl2 


9577 


0 


7005 


0 


8551 


0 


13538 


0 


7674 


0 


8847 


1 


36393 


0 


9952 


0 


8526 


0 


9577 


0 


15066 


0 


10470 


0 


22195 


0 


4261 


0 


84182 


0 


2797 


0 


4635 


0 


40392 


0 


40054 


0 


16731 


0 


13532 


0 


7801 


0 


10717 


0 


8 


7 


36313 


1 


5857 


0 


41633 


0 


3681 


1 


36535 


0 


86110 


0 


39304 


0 


32663 


0 


82498 


0 


27250 


0 


84328 


0 


86859 


0 


1120 


0 


85064 


0 


16970 


1 


16130 


0 


16653 


0 


83103 


0 


7005 


0 


689 


0 


13238 


0 


13072 


0 


9283 


0 


9283 


0 



Clones in Clones in 
Libl3 Libl4 



0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


6 


9 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Libl4 


M00001460A:F06 


2448 


0 


0 


0 


M00001460A:F12 


39498 


0 


0 


0 


M00001461A:D06 


1531 


0 


0 


1 


M00001463C:B11 


19 


17 


32 


31 


M00001465A:B11 


10145 


0 


0 


0 


M00001466A:E07 


4275 


0 


0 


0 


M00001467A:B07 


38759 


0 


0 


0 


M00001467A:D04 


39508 


0 


0 


0 


M00001467A:D08 


16283 


0 


0 


0 


M00001467A:D08 


16283 


0 


0 


0 


M00001467A:E10 


39442 


0 


0 


0 


M00001468A:F05 


7589 


0 


0 


0 


M00001469A:C10 


12081 


0 


0 


0 


M00001469A:H12 


19105 


0 


0 


0 


M00001470A:B10 


1037 


0 


0 


0 


M00001470A:C04 


39425 


0 


0 


0 


M00001471A:B01 


39478 


0 


0 


0 


M00001481D:A05 


7985 


0 


0 


0 


M00001490B:C04 


18699 


0 


0 


0 


M00001494D:F06 


7206 


0 


0 


0 


M00001497A:G02 


2623 


1 


0 


0 


M00001499B:A11 


10539 


0 


1 


0 


MOO0015O0A:CO5 


5336 


0 


0 


0 


M00001500A:E11 


2623 


1 


0 


0 


M00001500C:E04 


9443 


0 


0 


0 


M00001501D;C02 


9685 


0 


0 


0 


M00001504C:A07 


10185 


0 


0 


0 


M00001504C:H06 


6974 


0 


0 


0 


M00001504D:G06 


6420 


0 


0 


0 


M00001507A:H05 


39168 


0 


0 


0 


M00001511A:H06 


39412 


0 


0 


0 


M00001512A:A09 


39186 


0 


0 


0 


M00001512D:G09 


3956 


0 


0 


0 


M00001513A:B06 


4568 


0 


0 


0 


M00001513C:E08 


14364 


0 


0 


0 


M00001514C:D11 


40044 


0 


0 


0 


M00001517A:B07 


4313 


0 


0 


0 


M00001518C:B11 


8952 


0 


0 


0 


M00001528A:C04 


7337 


1 


2 


2 


M00001528A:F09 


18957 


0 


0 


0 


M00001528B:H04 


8358 


0 


0 


0 


M00001531A:D01 


38085 


0 


0 


0 


M00001532B:A06 


3990 


0 


0 


0 


M00001533A:C11 


2428 


0 


0 


0 


M00001534A:C04 


16921 


0 


0 


0 


M00001534A:D09 


5097 


0 


0 


0 


M00001534A:F09 


5321 


4 


7 


6 


M00001534C:A01 


4119 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 

M00001535A:B01 
M00001535A:C06 
M00001535A:F10 
M00001536A:B07 
M00001536A:C08 
M00001537A:F12 
M00001537B:G07 
M00001540A:D06 
M00001541A:D02 
M00001541A:F07 
M00001541A:H03 
M00001542A:A09 
M00001542A:E06 
M00001544A:E03 
M00001544A:G02 
M00001544B:B07 
M00001545A:C03 
M00001545A:D08 
M00001546A:G11 
M00001548A:E10 
M00001548A:H09 
M00001549A:B02 
M00001549A:D08 
M00001549B:F06 
M00001549C:E06 
M00001550A:A03 
M00001550A:G01 
M00001551A:B10 
M00001551A:F05 
M00001551A:G06 
M00001551C:G09 
M00001552A:B12 
M00001552A:D11 
M00001552B:D04 
M00001553A:H06 
M00001553B:F12 
M00001553D:D10 
M00001555A:B02 
M00001555A:C01 
M00001555D:G10 
M00001556A:C09 
M00001556A:F11 
M00001556A:H01 
M00001556B:C08 
M00001556B:G02 
M00001557A:D02 
M00001557A:D02 
MO0O01557A:F01 



Cluster ED Clones in 





Libl2 


7665 


0 


20212 


0 


39423 


0 


2696 


0 


39392 


0 


39420 


0 


3389 


0 


8286 


0 


3765 


0 


22085 


0 


39174 


0 


22113 


0 


39453 


0 


12170 


0 


19829 


0 


6974 


0 


19255 


0 


13864 


0 


1267 


0 


5892 


0 


1058 


1 


4015 


0 


10944 


1 


4193 


0 


16347 


0 


7239 


0 


5175 


1 


6268 


0 


39180 


0 


22390 


0 


3266 


0 


307 


6 


39458 


0 


5708 


0 


8298 


0 


4573 


0 


22814 


0 


39539 


0 


39195 


0 


4561 


0 


9244 


0 


1577 


0 


15855 


1 


4386 


3 


11294 


0 


7065 


0 


7065 


0 


9635 


0 



Clones in Clones in 
Libl3 Libl4 



2 


4 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


3 


0 


1 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


1 


0 


0 


0 


1 


0 


0 


11 


4 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


2 


1 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Libl4 


M00001557A:F03 


39490 


0 


0 


0 


M00001557B:H10 


5192 


0 


0 


0 


M00001557D:D09 


8761 


0 


0 


0 


MOO0O1558B:Hll 


7514 


0 


0 


0 


M00001560D:F10 


6558 


0 


0 


0 


MO0001561A:CO5 


39486 


0 


0 


0 


M00001563B:F06 


102 


2 


1 


2 


M00001564A:B12 


5053 


0 


0 


0 


M00001571C:H06 


5749 


0 


0 


0 


M00001578B:E04 


23001 


0 


0 


0 


M00001579D:C03 


6539 


0 


0 


0 


M00001583D:A10 


6293 


0 


0 


0 


MO0OO1586C:CO5 


4623 


0 


0 


0 


M00001587A:B11 


39380 


0 


0 


0 


M00001594B:H04 


260 


1 


0 


0 


M00001597C:H02 


4837 


1 


0 


0 


M00001597D:C05 


10470 


0 


0 


0 


M00001598A:G03 


16999 


4 


2 


6 


M00001601A:D08 


22794 


0 


0 


0 


M00001604A:B10 


1399 


6 


3 


3 


M00001604A:F05 


39391 


0 


0 


0 


M00001607A:E11 


11465 


0 


0 


0 


M00001608A:B03 


7802 


0 


0 


0 


M00001608B:E03 


22155 


0 


0 


0 


M00001614C:F10 


13157 


0 


0 


0 


M00001617C:E02 


17004 


0 


0 


0 


M00001619C:F12 


40314 


0 


0 


0 


MO0001621C:C08 


40044 


0 


0 


0 


M00001623D:F10 


13913 


0 


0 


0 


M00001624A:B06 


3277 


0 


0 


0 


M00001624C:F01 


4309 


0 


0 


0 


M00001630B:H09 


5214 


0 


1 


2 


M00001644C:B07 


39171 


0 


0 


0 


M00001645A:C12 


19267 


0 


0 


0 


M00001648C:A01 


4665 


0 


0 


0 


M00001657D:C03 


23201 


0 


0 


0 


M00001657D:F08 


76760 


0 


0 


0 


M00001662C:A09 


23218 


0 


0 


0 


M00001663A:E04 


35702 


0 


0 


0 


M00001669B:F02 


6468 


0 


0 


0 


M00001670C:H02 


14367 


0 


0 


0 


M00001673C:H02 


7015 


0 


0 


0 


M00001675A:C09 


8773 


0 


0 


0 


M00001676B:F05 


11460 


2 


0 


0 


M00001677C:E10 


14627 


0 


0 


0 


M00001677D:A07 


7570 


0 


0 


0 


M00001678D:F12 


4416 


1 


2 


0 


M00001679A:A06 


6660 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 

M00001679A:F10 
M00001679B:F01 
M00001679C:F01 
M00001679D:D03 
M00001679D:D03 
M00001680D:F08 
M00001682C:B12 
M00001686A:E06 
M00001688C:F09 
MO00O1693C:G01 
MO0001716D:H05 
M00003741D:C09 
M00003747D:C05 
M00003759B:B09 
M00003762C:B08 
M00003763A:F06 
M00003774C:A03 
MO00O3796C:D05 
M00003826B:A06 
M00003833A:E05 
M00003837D:A01 
M00003839A:D08 
M00003844C:B11 
M00003846B:D06 
M00003851B:D10 
MO00O3853A:D04 
M00003853A:F12 
M00003856B:C02 
M00003857A:G10 
MO0003857A:HO3 
M00003871C:E02 
M00003875B:F04 
M00003875B:F04 
M00003875C:G07 
M00003876D:E12 
M00003879B:C11 
M00003879B:D10 
M00003879D:A02 
M00003885C:A02 
M00003885C:A02 
MO00O39O6C:E10 
M00003907D:A09 
M00003907D:H04 
M00003909D:C03 
M00003912B:D01 
M00003914C:F05 
M00003922A:E06 
M00003958A:H02 



Cluster ID Clones in 
Libl2 



26875 


0 


6298 


0 


78091 


0 


10751 


0 


10751 


0 


10539 


0 


17055 


0 


4622 


0 


5382 


0 


4393 


0 


67252 


0 


40108 


0 


11476 


0 


697 


0 


17076 


0 


3108 


0 


67907 


0 


5619 


0 


11350 


0 


21877 


0 


7899 


0 


7798 


0 


6539 


0 


6874 


0 


13595 


0 


5619 


0 


10515 


0 


4622 


0 


3389 


0 


4718 


0 


4573 


0 


12977 


0 


12977 


0 


8479 


1 


7798 


0 


5345 


4 


31587 


0 


14507 


0 


13576 


0 


13576 


0 


9285 


0 


39809 


0 


16317 


0 


8672 


0 


12532 


0 


3900 


0 


23255 


0 


18957 


0 



Clones in Clones in 
Libl3 Libl4 



0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


8 


3 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 

M00003958A:H02 
M00003958C:G10 
M00003958C:G10 
M00003968B:F06 
M00003970C:B09 
M00003974D:E07 
M00003974D:H02 
M00003975A:G1 1 
M00003978B:G05 
M00003981A:E10 
M00003982C:C02 
M00003983A:A05 
M00004028D:A06 
M00004028D:C05 
M00004031A:A12 
M00004031A:A12 
M00004035C:A07 
M00004035D:B06 
M00004059A:D06 
M00004068B:A01 
M00004072B:B05 
M00004081C:D10 
M00004081C:D12 
M00004086D:G06 
M00004087D:A01 
M00004093D:B12 
M00004093D:B12 
M00004105C:A04 
M00004108A:E06 
M00004111D:A08 
M00004114C:F11 
M00004138B.-H02 
M00004146C:C11 
M00004151D:B08 
M00004157C:A09 
M00004169C:C12 
M00004171D:B03 
M00004172C:D08 
M00004183C:D07 
M00004185C:C03 
M00004197D:H01 
M00004203B:C12 
M00004212B:C07 
M00004214C:H05 
M00004223A:G10 
M00004223B:D09 
M00004223D:E04 
M00004229B:F08 



Cluster ID Clones in 





Libl2 


18957 


0 


40455 


0 


40455 


0 


24488 


0 


40122 


0 


23210 


0 


23358 


0 


12439 


0 


5693 


0 


3430 


0 


2433 


2 


9105 


0 


6124 


0 


40073 


0 


9061 


0 


9061 


0 


37285 


0 


17036 


0 


5417 


0 


3706 


0 


17036 


0 


15069 


0 


14391 


0 


9285 


0 


6880 


0 


5325 


0 


5325 


0 


7221 


0 


4937 


0 


6874 


0 


13183 


0 


13272 


0 


5257 


0 


16977 


0 


6455 


0 


5319 


0 


4908 


0 


11494 


0 


16392 


0 


11443 


2 


8210 


0 


14311 


0 


2379 


0 


11451 


0 


16918 


0 


7899 


0 


12971 


0 


6455 


0 



Clones in Clones in 
Libl3 Libl4 



A 
U 


A 

u 


A 
U 


A 

u 


A 
(J 


A 

u 


A 
0 


A 

u 


A 
U 


A 

u 


A. 


A 

u 


A. 


A 

0 


A 
0 


A 

u 


A 
0 


A 

0 


0 


A 

0 


4 


A 

0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


A 

0 


A 

0 


A 

0 


0 


A 

u 


A 

0 


A 

u 


A 


A 

u 


A 

u 


A 

u 


A 

u 


A 

u 


A 

u 


A 

u 


A 


1 

1 


A 

u 


A 

u 


A 

0 


A 

u 


A 

0 


A 

u 


A 

u 


A 

u 


A 

u 


A 

u 


0 


A 

0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Lib 14 


M00004230B:C07 


7212 


0 


0 


1 


M00004269D:D06 


4905 


0 


0 


0 


M00004275C:C1 1 


16914 


0 


0 


A 


M00004283B:A04 


14286 


0 


0 


0 


M00004285B:E08 


56020 


0 


0 


0 


M00004295D:F12 


16921 


0 


0 


0 


M00004296C:H07 


13046 


0 


0 


0 


M00004307C:A06 


9457 


1 


0 


0 


M00004312A:G03 


26295 


0 


0 


0 


M00004318C:D10 


21847 


0 


0 


0 


M00004372A:A03 


2030 


0 


0 


0 


M00004377C:F05 


2102 


0 


0 


0 
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1 . A library of polynucleotides, the library comprising the sequence information of 
at least one of SEQ ID NOS: 1 -844. 

5 

2. The library of claim 1, wherein the library is provided on a nucleic acid array. 

3. The library of claim 1, wherein the library is provided in a computer-readable 

format. 

10 

4. The library of claim 1, wherein the library comprises a differentially expressed 
polynucleotide comprising a sequence selected from the group consisting of SEQ ID NOS:9, 
39, 42, 52, 62, 74, 1 19, 172, 317, and 379. 

15 5. The library of claim 1, wherein the library comprises a polynucleotide 

differentially expressed in a human breast cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS: 4, 9, 39, 42, 52, 62, 65, 66, 68, 
74, 81, 114, 123, 144, 130, 157, 162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 317, 338, 
379, 384, 386, and 388. 

20 

6. The library of claim 1, wherein the library comprises a polynucleotide 
differentially expressed in a human colon cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS: 1, 39, 52, 97, 119, 134, 172, 
176, 241, 288, 317, 357, 362, and 374. 

25 

7. The library of claim 1 , wherein the library comprises a polynucleotide 
differentially expressed in a human lung cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 1 19, 
135, 154, 160, 260, 308, 323, 349, 361, 369, 371, 379, 395, 381, and 400. 

30 

8. An isolated polynucleotide comprising a nucleotide sequence having at least 90% 
sequence identity to an identifying sequence of SEQ ID NOS: 1-844 or a degenerate variant 
thereof. 
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9. An isolated polynucleotide according to claim 8, wherein the polynucleotide 
comprises a seqeuence encoding a polypeptide of a protein family selected from the group 
consisting of: 4 transmembrane segments integral membrane proteins, 7 transmembrane 
receptors, ATPases associated with various cellular activities (AAA), eukaryotic aspartyl 
proteases, GATA family of transcription factors, G-protein alpha subunit, phorbol 
esters/diacylglycerol binding proteins, protein kinase, protein phosphatase 2C, protein 
tyrosine phosphatase, trypsin, wnt family of developmental signaling proteins, and 
WW/rsp5/WWP domain containing proteins. 

1 0. The polynucleotide of claim 9, wherein the polynucleotide comprises a 
sequence of one of SEQ ID NOS: 24, 41, 101, 157, 291, 305, 315, 341, 63, 116, 134, 136, 
151, 384, 404, 308, 213, 367, 188, 251, 202, 315, 367, 397, 256, 382, 169, 23, 291, 324, 330, 
341,353,188, 379, and 395. 

1 1 . The polynucleotide of claim 8, wherein the polynucleotide comprises a 
seqeuence encoding a polypeptide having a functional domain selected from the group 
consisting of: Ank repeat, basic region plus leucine zipper transcription factors, 
bromodomain, EF-hand, SH3 domain, WD domain/G-beta repeats, zinc finger (C2H2 type), 
zinc finger (CCHC class), and zinc-binding metalloprotease domain. 

12. The polynucleotide of claim 11, wherein the polynucleotide comprises a 
sequence of one of SEQ ID NOS: 116,251,374, 97, 136, 242,379,306, 386, 18, 335,61, 
306, 386, 322, 306, and 395. 

13. A recombinant host cell containing the polynucleotide of claim 8. 

14. An isolated polypeptide encoded by the polynucleotide of claim 8. 

1 5. An antibody that specifically binds a polypeptide of claim 14. 

16. A vector comprising the polynucleotide of claim 8. 
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17. A polynucleotide comprising the nucleotide sequence of an insert contained in a 
clone deposited as ATCC accession number xx, xx, xx, xx, xx, xx, xx, xx, or xx. 



18. A method of detecting differentially expressed genes correlated with a cancerous 
5 state of a mammalian cell, the method comprising the step of: 

detecting at least one differentially expressed gene product in a test sample derived 
from a cell suspected of being cancerous, where the gene product is encoded by a gene 
corresponding to a sequence of at least one of SEQ ID NOS:4, 9, 39, 42, 52, 62, 65, 66, 68, 
74,81, 114, 123, 144, 130, 157, 162, 172, 178, 183,202,214,219, 223,258,298,317,338, 
10 379, 384, 386, 388, 1, 39, 52, 97, 119, 134, 172, 176, 241, 288, 317, 357, 362, 374, 9, 34, 42, 
62, 74, 106, 1 19, 135, 154, 160, 260, 308, 323, 349, 361, 369, 371, 379, 395, 381, and 400; 

wherein detection of the differentially expressed gene product is correlated with a 
cancerous state of the cell from which the test sample was derived. 

15 19. The method of claim 1 8, wherein said detecting step is by hybridization of the 

test sample to a reference array, wherein the reference array comprises an identifying 
sequence of at least one of SEQ ID NOS:l-844. 

20. The method of claim 18, wherein the cell is a breast tissue derived cell, and the 
20 differentially expressed gene product is encoded by a gene corresponding to a sequence of at 

least one of SEQ ID NOS: 4, 9, 39, 42, 52, 62, 65, 66, 68, 74, 81, 1 14, 123, 144, 130, 157, 
162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 317, 338, 379, 384, 386, and 388. 

21. The method of claim 18, wherein the cell is a colon tissue derived cell, and the 
25 differentially expressed gene product is encoded by a gene corresponding to a sequence of at 

least one of SEQ ID NOS: 1, 39, 52, 97, 1 19, 134, 172, 176, 241, 288, 317, 357, 362, and 
374. 

22. The method of claim 18, wherein the cell is a lung tissue derived cell, and the 
30 differentially expressed gene product is encoded by a gene corresponding to a sequence of at 

least one of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 119, 135, 154, 160, 260, 308, 323, 349, 
361, 369, 371, 379, 395, 381, and 400. 
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Ivan Labat 
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David Kita 
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<120> Novel Human Genes and Gene Expression 
Products I 

<130> 2300-14B0P . 

<140> 09/ 

<141> 1998-12-21 

<150> 60/068,755 
<151> 1997-12-23 

<150> 60/080,664 
<151> 1998-04-03 

<150> 60/105,234 
<151> 1998-10-21 

<160> 844 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 300 

<212> DNA 

<213> Homo sapiens 
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<400> 1 

tctcccctga gctgcaggcc tgcatatcca gtaggtctac tggacatctg tactggttgt 60 

tgnggaggaa cctctggctt gctcattaag tcctactgat tttcactatc ccctgaattt 120 

ccccacttat ttttgtcttt cactatcgca ggccttagaa gaggtctacc tgcctccagt 180 

cttacctagt ccagtctacc ccctggagtt agaatggcca tcctgaagtg aaaagtaatg 24 0 

tcacattact cccttcagtg atttcttgta gaagtgccaa tccctgaatg ccaccaagat 3 00 

<210> 2 
<211> 299 
<212> DNA 

<213> Homo sapiens 
<400> 2 

cccagctgct caggaggctg aggcaggaga attgcttgaa cccaagaggc ggaggttgtg 60 
gtgagccgag attgcacctt tgtactccag cctgggcaac gagcaaaaaa ctctgtctca 120 
aacaaaaaga agaaaaaaaa aaaaaaaann nnnnnnnnnn nnnnnnnnnn nnnttnttct 180 
ggcgncnagt cccaaantcn taccttgtaa gacctttann tnncctgngg tntttntnna 240 
cncttanata nnnntntttn ctatcaanta tagggagant tttcntttng gggcaactt 299 

<210> 3 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 3 

atacgattcg aacnnggaca agacgagtat 
ganattatgc ngngatagng cttgttccat 
cnngagttac tatccaaaca cacgttttca 
accatgaggc tttccangat ntttctannt 
tacacaccgc nctngncnnc cnncactgan 

<210> 4 
<211> 287 
<212> DNA 

<213> Homo sapiens 
<400> 4 

aaancngcac gangccacgt ncgnnnngnt nntactnnnc natngccncn tcantggcng 60 
ncagctagac gcctaacagc cgangancca nccntnntgt ganccngtcn tgacngnnag 120 
cntgccggtc ttgctcnttt tgtctaccnn gagganannn ntntgggaca tcccagactg 180 
agtgaggaga tctgcngctg cnnctgtact tggttacanc ncacacgang actntncctt 240 
ggactanana cactagccta anattcngca ctacctantc ctctggc 287 

<210> 5 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 5 

gtccttttga accaccccaa agaactcaac atggcaaagc aaatggtaaa agcttcccga 60 
ctgttctact ttgggtccgc gcgaagccca ctcacgtgtg atctgtgttg cccctgggag 120 



ggaataatat cccactnnnt 
tcnaccagcg aatnatgcat 
cgntacctga ngctggtnga 
ancagacngn gnacaatgnt 
cangtnacnn ngctcactgn 



ttacaatact 6 0 

tnacncnaca 120 

naattatgcg 180 

gaanaagcng 24 0 

ngcctctttc 300 
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gcccggggcg accggaaaag ggctctctca agttctgaaa agagaatctg ccaccagatc 180 
gaatttcgac ccctgagctt gttcggacgt atggtccaaa ttcagattaa ggtggtcacc 240 
caacccgaga tgtcaggaaa ggccttctgc agagaaaatg tccccccacc cgccatctgc 300 

<210> 6 

<211> 284 

<212> DNA 

<213> Homo sapiens 

<400> 6 

tntccccttt gacgccttan tgccctnncg ctacnngtcc nttaggcctt atcccatcgn 60 

ccntcgtttn gcattctgcc nnagantgac tttncnatca tgcntnatnn gtnnttacna 120 

ggggctnggg tgaattntta caccctgcna ntccatanca cantgccntg cnagctncac 180 

cctcntgaat aaatgcaata aantttcngt tgatcttata caccttatgc nccntantta 240 

atcagccctn tnttacnana tcnanttatg cnggtattaa aaca 284 

<210> 7 

<211> 277 

<212> DNA 

<213> Homo sapiens 

<400> 7 

gtgctgcaga caacacacct tcctgatgga ggtgtccggc tgatggagaa gtctgtgggc 60 

ttgtaaatca tctttgatgt taaccaggcc gacgctgtgg ccacattccg aaagattaac 12 0 

cctgtcaaac cctannnnnn nnnnnnnnnn nnnggatttg atnagcctgt nccanacctc 180 

tgcagcctcn ancggtngtn ntaccatagt ggggatgacc ctctgatact ttgncctggt 240 

ngancatgnt gacanntgct tctacagctt nngggac 277 

<210> 8 

<211> 292 

<212> DNA 

<213> Homo sapiens 

i 

<400> 8 

cttgggaggc tgagtcagga gaattgcttg agcccaggag atggaggttg cagtgagcca 60 

agatcatgcc actgcactcc agactgggca acagagggag actccgtctc aaaaactaaa 120 

aaaaaaaata catttagtat agcggggggt gggcgggaga aataatgtta tttcctatgc 180 

aaatgacgnn nnnnnnnnnn cccatggtaa atgtnaatat actgcgtctn ttttgggana 24 0 

gccttttant aaangagtct tanatgaatc tctanntnat gantttaact tg 292 

<210> 9 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 9 

ccaggttagc tgctgaatca aagcttcaaa cagaagttaa agaaggaaaa gaaacttcaa 60 
gcaaattgga aaaagaaact tgtaagaaat cacaccctat tctatatgtg tcttctaaat 120 
ctactccaga gacccagtgc cctcaacagt aaagactttt ctttaataag agtacggtgc 180 
cacttgcctc aaaagttact atggtgctta agattgtctt gatctgacat atatcacctt 240 
ctgggttatt tactcattgt gccaggacct ggcattttca tgtgcctttg accaagtgtt 300 
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<210> 10 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 10 

aggaggcgga gcttgcagtg agctgagatc gcgccactgc actccagcct gggcaacaga 60 

gtgagactct gtctcaaaaa aaaaaaaaaa nnnnnrmnnn nnnnnnnnaa nctcgtnttn 120 

gnaaggaaan ggggnaangg accggtntta tncctatgtn gtntttgcag gcaaangaaa 180 

nggacccntt tttgtaaaaa aaagtctttt gnncaantaa acggggtntg ngggtncagg 240 

ccctggnggg gcncncantt gcctggnggc ttntgnnaaa tcggnaaagg gaggaaaggc 300 

<210> 11 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 11 

cgtctgtaat cccagctgct tgggaggctg aggcaggaga atcacttgaa ccctggaggt 60 

ggcggtttca gtgagcacag atcatgccac tgcactccag cctgggcaac aaaacgagac 120 

ttcgtctcaa aaaaaaaaan nnnnnnnnnn nnnnnrmnnn nncgggttct cccaaattnt 180 

ttnnaggggn ccatggncaa ctgnttnacn tttgtttngg naaccccntg cccnaagncg 240 

cananaggct gtnnttnncc ttgttnccaa ggntgaggan caaaaagtac cctntgtttt 300 

<210> 12 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 12 

caaagatggt cgtattacta aaggtgaata accagcgcgg ggggcacgtg gagtcactgg 60 

aacatttgtg caatgctggt gggaatgtca acccgtgcgg ccctctggaa taagcctggc 120 

agctcctcca agagttaccg tgtgacccag caattccact cctagctcca cccacaggaa 18 0 

ttgaaagcaa agacgcaaac agatgcctgt gcaccaaagt tcacggcagc atccttcgcc 24 0 

atagtggcag catccgtcgt cacagcggca tcatccttca tcatagcggc agcatccgtc 300 

<210> 13 

<211> 278 

<212> DNA 

<213> Homo sapiens 

<400> 13 

cctgcagcca ctaatgcatt gtgtatgata acaaaaactc tggtatgaca cattttctgt 60 

gatcattgtt aattagtgac atagtaacat ctgtagcagc tggttagtaa acctcatgtg 120 

ggggaggtgt gggaggtttt nncgnnannn nnngcnnnnn annccccggn nngnnngaag 180 

ctgnnnnttn naannngcnn nnnannatga naannnccnn ngactggnnn nangaggcct 240 

ancccntgnt ttananaaac nnncnncagn nt etc tea 278 

<210> 14 
<211> 300 
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<212> DNA 

<213> Homo sapiens 
<400> 14 

gtgtcttcat cttacccagt ggaacctaag aaattaaatt ctccagaaga aactgctttt 60 

cagacaccaa aatctagcca gatgcctcgg ccttcagtgc caccattagt taaaacatca 120 

ctgttttctt caaaattatc tacacctgat gttgtgagcc catttgggac cccatttggc 180 

tctagtgtaa tgaatcggat ggctggaatt tttgatgtaa acacctgcta tgggtcaccg 240 

caaagtcctc agctaataag aagggggcca agattgtgga catcagcttc tgatcagcaa 300 

<210> 15 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 15 

gttatattaa attattcttt gtttttcttt ttcttttaat aaagcctgca agttactaaa 60 
ttgtagtttc ataaattctg tagtaaagta tcatcttggc agtgtgccaa aggtgaaaat 120 
gatgctttct ctaacagaga aattcttagt gactccagtc gtagaaaaac gtctttacaa 180 
cctgaataag attgaagaat tgtgaacata ccatggccta ttggatgaat catttgccgt 24 0 
aggctaaatc agactgtagg gtttgcgatg gatttatgga gtatgtgggt atagaaatca 3 00 

<210> 16 
<211> 276 
<212> DNA 

<213> Homo sapiens 
<400> 16 

gtttcattta agaagaatga gctagataaa tgtgctcttc tggttacccc accctgacag 60 
agtgcatttt tacacggcta gcaggggttg agactgcagc ctggcctnnn nnnnnnnnnn 120 
ngnnnnnngc nnacttnact tcccngaanc actataattg gnanacnttn ctaannggtn 180 
atctngccga cctgnnagat anactcnnga taaaanccnn tgcagaaagc gcccttccat 24 0 

gtcangcncn tnaganacnn ncntaccncc tangna 276 

<210> 17 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 17 

ggtgcccatc accacaccca gctaactttt gtatttttag tagagacggg gtttcaccat 60 
gttggccagg ctggtcttga actcctgacc tcgtgatccg cccgccttgg ccccgcaaag 120 
tgctgggatt acaagcatga gcccagcgcc tggctgtatc tttcatttta cccaagtcac 180 
tttacccaag taagtaatta ggggaaagcc tgagtcttgt accacctgtt catttgggga 240 
actgtgggaa acggagccaa cggacctaag tgccctttga cagtgagttt cataccattt 300 

<210> 18 
<211> 273 
<212> DNA 

<213> Homo sapiens 
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<400> 18 

ctcagctgag gcaattaaac tggaaaagaa atagattgaa aagatactac agaagaagca 60 

gtacagaagt tgggggactg aaggagaggg agccactgca ggtgctagct gcttaagggg 120 

ataccagtcc ttttacagat ataatagata cagcttctga ggtggagggt gataggagtg 180 

tgtagagaaa ttgcagttca gaactggagc atgcagttag gcaagaggca tcccatgtga 24 0 

agatgtcaag caagtactgg aaaatgctga act 273 

<210> 19 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 19 

gggtcctggt gggagttcca tccagcagtg agtgcatttt ttccccagag cagttaaggg 60 

tcttattaaa agccaccact ttgctgaggc ctgtacaggc cttgggggtt tggggaagag 12 0 

aaataaggca ggcacttgtc ccttcaggga gggacttgtc cctcactggg aggtttgggg 180 

ttgaccttgg ctccagcaga gatacccagc ctggcgtgga aggggcaggt ctgagcttac 240 

gcttgactgc agggcaagct gcaggcctct tctgccttcc cctgcattca ccaaggacag 300 

<210> 20 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 20 

atggcatgca ctgacctctt cttggagccc agaactttat agagttgcct accagggtta 60 

ctgtaatgga atttatgatc ttaagaaatt actagttgta ttatttatcc tatgattcat 120 

tcattcaata agcttttact gcataaactt tacatccagc actgtagtta agtacccaaa 180 

attgaataga aataatggct tttgaaaatc gcacaaagca ggccaggcac ggtggctcac 240 

gcctgtaatc ccagcatttt gggaggccga ggcaggcgga tcacgaggtc aagagatcca 300 

<210> 21 

<211> 293 

<212> DNA 

<213> Homo sapiens 

<400> 21 

cgtctgtaat cccagctgct tgggaggctg aggcaggaga atcacttgaa ccctggaggt 60 

ggcggttgca gtgagcacag atcatgccac tgcactccag cctgggcaac aaaacgagac 12 0 

ttcgtctcaa aaaaaaaaan nnnnnnnnnn nnnnccttng gncgggttnt cccaaattnt 180 

tttgaggngn ccatggncaa ctgcttnanc tttgttttgg caaccccntg cccnaagtcg 240 

catataggct gtncttcacc ttgtttccaa ggctgnggaa canaaagtaa cct 293 

<210> 22 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 22 

ctggtctcga acacctgacc tcaggtgatc cattcgtctt ggcctctcga agtgctggga 60 
ttccaggcgt gagccactgc ggccagcaca tttccacttt tagatcctac tccataccac 120 
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aggtttcatt taagaagaaa gagctagata aatgtgctct tctggttacc ccaccctgac 180 
agagtgcatt tttacacggc tagcaggggt tgagactgca gcctggcctg ccagccattg 24 0 
gaggtgttta aggaagggca gataatgtga ctctttgcgg ggtgccatct gcttacccat 300 



<210> 23 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 23 

gaaccaaaga cgtgtatgga gtgttctctt gtccttatcg acttgctctg ctcccagctt 60 
tccaagcgac cggatctgag tgatgcttct agaacatttg ggtgttgggg ggttcccaat 120 
agtagaaagg gtccccattc ctgctcagca ccgcacctct ctaccccccc acagacacac 180 
atgcagacac acacatgcag acaacacgca gacacacaca tgcaggcact cacatgcagg 240 
cccatgcaca cacacgtgca cacacatgca gagacatgca gacacgcagg cacacatgca 300 

<210> 24 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 24 

cctcccacaa cacgtgggaa ttcaagatga gatttggttg gggacacagc caacccatat 6 0 

cacccatgcc tggatgccct tctcatgctt gggttctgtc atctgcacca ggccttctgc 120 
tgcccgtctg tcttacccac caggactctg actctccacg ctgggccacc tctcttctcc 180 
aacactgcta tggattgaat gtttatgtta tccccaaatt tgcatgttgc aatcccaatc 24 0 
tccaatgcca tagtattagg aggtgggggc ctttgggagg tgatttggtc atgaaggtgg 300 

<210> 25 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 25 

ggaaaatgaa atctgactat ctgctagttg 
ttagttggga aagaaggcag cacttgaaaa 
ggaaggggcg tgggacgcac gcggtcactc 
ctgtagctgt gcctctacag actcccgctg 
ccagtaggca aagcttggcc ctattagctt 

<210> 26 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 26 

ctgcagtgag attctctgca atgactggcc 
tcccaggtca ctaagccaca taggataagt 
ggcagggcac atgtttaaaa cttgaacttt 
cagctgatat atttggatac cagttgacta 



ccaaaaccca gaaacattcc 
aatttaccag gttcctcact 
cctctcagcc ccccacattt 
cctggcctcc acagatcctg 
tttctctcca tggctctgtg 



tgtgtaatgg 60 

gggagatgtg 120 

ctagaacaca 180 

ctcagattca 240 

ggaatgtgcg 300 



tcagcaaggg ggcagcttag gaccctgaca 60 

aatgggtgga cagaagcggg aaaggagaag 120 

ctgaggctaa gactggaaaa ggaatggttt 18 0 

tttttaggaa aaaaacacaa atggctttta 24 0 
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aacatcacag tgtgatacag tctaactcag aattagagac aggcaaaaca gaactccatc 300 

<210> 27 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 27 

gtactgcttc tgtggctctt cacagacctc acggatgtga ccggagatga gtgccgatga 60 

ccacgtttta aaggagaaag agagctcctg gtggggccct cggggtggtc tcaggtccca 120 

tttgcagtct gcaacagtga cgcgcagccc ggtccggagc gtggtgagct ttgtttgcct 180 

tctgggtcag ctttcgctgt gtctcctgtg tgtgttagaa tccagagccc agaggaagtg 240 

caagcgggtc ctccgccaac ggggagagcc tcttcgcggc gctgttggcg acagcacgct 300 

<210> 28 

<211> 298 

<212> DNA 

<213> Homo sapiens 

<400> 28 

aangnaannn nggggngttg antcnacctn ngaaccgtgt anaaacccat ggaaacagct 60 

antaganntt gggcagganc agagngaggc caagntacgg gggaggcnag gagcngagan 120 

tggggnnnnn nnangnnaan tnnngaaggg gngngannga gggggggana naagggggga 180 

ngagggcgaa ngncaggann nagaaaannn ggggacgana nggngaacag ggnnnaaacg 240 

gaannnngga gnnnnnanag atgncgggca gngncngngn aggnganann ngagacgg 298 

<210> 29 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 29 

cctcagcccc acaccagctc tatttcaggg gtgagagtca gagagcactg caatatgtgc 60 

ttcatgggat ttcgattcga agatcctaga ccagggagac actgtgagcc agggatacaa 120 

caaaatacta ggtaagtcac tgcagaccga cctccctgca gtttgggaaa gaagctgggt 180 

ttgtggagaa tcagagcatc ttgacatgac tgctgaccta aagatccctg gcattggcca 24 0 

gggatcctgt ggaacctctt ctagttcagg ggtgtgagca ttagactgcc agttgtctag 300 

<210> 30 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 30 

gtttgtttcc ccgagatgtg aacttgctga aggaaaacag tgtaaagagg aaggccatac 60 

agagaactgt cage tct tea ggatgtgaag gcaagaggaa tgaagacaag gaagcagtga 120 

gcatgttggt taactgccct gcctactaca gtgtgtctgc tcccaaggct gagctactga 180 

acaaaatcaa agagatgeca nnnnnnnnnn nntgaggaag aggaacaggc anatgtcaat 24 0 

gaaaagaagg ctgatctcat tggaagtctc acccacaagc tggagaccct ecaggaggeg 300 

<210> 31 
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<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 31 

tttaaactga gctccaaatg acgttcaaac acccctctcg ggtagagttt tcatggtgga 60 

acggttgcgc ccaccaaaca gaagcttatg tttttggcac agaaggcctg ggccattttc 120 

atggacacct ggctggacct cggtggaagt gaactccgta ggttgttgcg ttcactgcag 180 

cacctcacat gataccgtcc cctctcatgg aacggagcct cccccatgca gcccccactc 240 

aaatggagtt ttaaaggctg ggttcaggtt acgggggcgt ttctcaccgt ctgaatgcgg 300 

<210> 32 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 32 

gtgaaacaga aagtggagat gctttccttg acctgaagaa gcctcctgcc tccaaatgcc 60 
cccatcgcta tacaaaagaa gaactcttgg atataaaaga actcccccat tccaaacaga 120 
ggccttcatg cctttctgaa aaatatgaca gtgatggtgt ctgggaccct gagaagtggc 180 
atgcctctct ctacccagct tcagggcgga gctcaccagt ggaaagtctg aagaaagagt 240 
tggatacaga ccggccttcc ctggtgcgca ggatagtaga tccacgagag cgtgtgaaag 3 00 

<210> 33 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 33 

gtctgattga agctgttcag gtttatcatg caaatcctcg cctctggcta cggctggctg 60 

aatgctgcat tgctgccaat aaggggactt ctgaacaaga aactaaaggc cttcccagca 120 

aaaaaggaat tgtacagtct attgttggtc aaggctatca tcgtaaaata gttttggcat 180 

cacagtctat acagaatact gtttataatg atgggcagtc ttcggccatt cctgtagcca 24 0 

gtatggagtt tgcagccata tgtctcagaa atgccttgtt gctgctacct gaagaacagc 300 

<210> 34 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 34 

tgacagagct gttcagcgta caccagatcg atgagctggc caagtgcaca tcagacactg 60 

tgttcctgga gaagaccagt aagatctcgg accttatcag cagcatcacg caggactacc 120 

acctggatga gcaggatgct gagggccgcc tggtacgcgg catcattcgc attattaccc 180 

gaaagagccg tgctcgccca cagacctcgg agggtcgttc aactcgggct gctgccccaa 240 

ccgctgctgc ccctgacagt ggccatgaga ccatggtggg ctcaggtctc agccaggatg 300 

<210> 35 

<211> 300 

<212> DNA 

<213> Homo sapiens 
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<400> 35 



cttttttaag caaagcagtt tctagttaat 
taagcttgtt gtgcccggta accatggtcc 
gcttcttcac ccagacacca aggtatgaga 
aaacaaaaac attctaaagc cattgttctt 
tgccagggtg ccctggtctg agctggcatc 



gtagcatctt 
tcttgctctg 
tggccctgcc 
gcttcatgga 
cccatgtctt 



ggactttggg gcgtcattct 
attaaccctt ccttcaatgg 
aagtgtcggc ctctcctgtt 
caagaggcag ccggagagag 
ctgtgtccga gggcagcatg 



60 
120 
180 
240 
300 



<210> 36 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 36 

gctggccaaa gccaaatctc ctaagtccac cgcccaggag ggaaccctga agcctgaagg 60 
agttacggag gccaaacatc cagctgcagt tcgcctccaa gaaggggtcc atggccctag 120 
tcgagtccat gtgggctctg gggaccatga ctattgtgtc cggagcagga cccccccaaa 180 
aaagatgcct gccctagtca ttccagaggt gggctcccga tggaatgtca agcgccatca 24 0 
ggacatcacc atcaaacctg tcttgtcctt gggcccagct gcccttcgcc ccatgcatag 300 

<210> 37 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 37 

gtccaaggac aacttcgaga catttctttt tgccaccgta tctaacaggg agcaggaaga 60 
tctctgccga ggaattgtcc agctctgctt caatgagcaa agccaacagc tgctagcaga 120 
ggtccagccc tctgactctt tcctcatggt agagacaact gcatactttg aggcctacag 180 
gcacgtcctg gaaggactcc aggaggtcca ggaggaagat gttcccttcc agaggaatat 240 
cgtggagtgt aactctcatg tgaaggagcc aaggtacttg ctaatggggg gcagatatga 300 

<210> 38 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 38 

catccaggga gaacctcggg gctgggacac ctcctggccc tcaccctggg tcatgtttac 60 
agtcctcagt gccccacacc ggtggccccc tgaggacacc tccaccctga ccttgatttt 120 
cccaaacgct gcctcttggt gacagactca gcccaaaacc ccttccttct gtctctggag 180 
acccttgagc ttggggaaat atggaggggt gtgtgtctgc aatcaaggcc tctgcagctc 240 
acggctggcc cggtgggctg ggacttccgt atgaattnta aatacttagg gttcattttt 300 

<210> 39 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 39 

gggaaggagc gggcgtgagg ccagctgagg catggtgacc cctgggaagg agcgggcgtg 60 
aggccagctg aggcatggcg acccctggga aggagcgggc gtgaggccag ctgaggcatg 120 
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gtgacccctg ggaaggagcg ggcgtgaggc cagctgaggc atggtgaccc ctgggtacgg 180 
gggacttggg ggccgcacct tggtttgccc agggcccctc ctgcaccacg ggccacatgc 240 
ggaggacggc gtgggatagg ctccctgggt ccacagcttc tgcccgtgta tggggaaccc 3 00 

<210> 40 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 40 

ccaaaagctt gtggcaaatt tgaaatttct gccattaggg accttacaac tggctatgat 60 

gatagccaac ctgataaaaa agctgttctt cccactagta aaagcagcca aatgatcacc 120 

ttcacctttg ctaatggagg cgtggccacc atgcgcacca gtgggacaga gcccaaaatc 180 

aagtactatg cagagctgtg tgccccacct gggaacagtg atcctgagca gctgaagaag 24 0 

gaactgaatg aactggtcag tgctattgaa gaacattttt tccagccaca gaagtacaat 300 

<210> 41 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 41 

aaaaggtccc ccttctggga aagaccgagt gaagaaaggt ggatcctaca tgtgccatag 60 

gtcttattgt tacaggtatc gctgtgctgc tcggagccag aacacacctg atagctctgc 120 

ttcgaatctg ggattccgct gtgcagccga ccgcctgccc actatggact gacaaccaag 180 

gaaagtcttc cccagtccaa ggagcagtcg tgtctgacct acattgggct tttctcagaa 240 

ctttgaacga tcccatgcaa agaattccca ccctgaggtg ggttacatac ctgcccaatg 300 

<210> 42 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 42 

ttctaagtca ggagtacagt acaaaggaca tgtggagatc cccaatttgt ctgatgaaaa 60 

cagcgtggat gaagtggaga ttagtgtgag ccttgccaaa gatgagcctg acacaaatct 120 

cgtggcctta atgaaggaag aaggggtgaa acttctaaga gaagcaatgg gaatttacat 180 

cagcaccctc aaaacagagt tcacccaggg catgatctta cctacaatga atggagagtc 240 

agtagaccca gtggggcagc cagcactgaa aactgaggag cgcaaggcta agcctgctcc 300 

<210> 43 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 43 

gccaccgaag cttcaggatg acatcttaga ctctcttggt caggggatca atgagttaaa 60 

gactgcagaa caaatcaacg agcatgtttc aggccccttt gtgcagttct ttgtcaagat 120 

tgtgggccat tatgcttcct atatcaagcg ggaagcaaat gggcaaggcc acttccaaga 180 

aagatccttc tgtaaggctc tgacctccaa gaccaaccgc cgatttgtga agaagtttgt 240 

gaagacacag ctcttctcac ttttcatcca ggaagccgag aagagcaaga atcctcctgc 300 
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<210> 44 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 44 

ggcttataca acatagtggg gaacgcatgg gaatggactt cagactggtg gactgttcat 60 
cattctgttg aagaaacgct taacccaaaa ggtccccctt ctgggaaaga ccgagtgaag 120 
aaaggtggat cctacatgtg ccataggtct tattgttaca ggtatcgctg tgctgctcgg 180 
agccagaaca cacctgatag ctctgcttcg aatctgggat tccgctgtgc agccgaccgg 24 0 
ctgcccacta tggactgaca accaaggaaa gtcttcccca gtccaaggag cagccgtgtc 300 

<210> 45 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 45 

gtggaagaaa attttttgct gcttctggtt cccagaaaag ggagccattt 
atctgtcaaa agaaatgact tgtcgattat ttctggctaa tttttcttta 
tctcacacct ggcgagctgt ggcatgcttt taaacagagt tcatttccag 
cagtgcaccc tgctttaaga aaatgaactt atgcaaatag acatccacag 
ttaaggggtg atcaccaagt ttcataatat tttcccttta taaaaggatt 

<210> 46 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 46 

gtggaagaaa attttttgct gcttctggtt cccagaaaag ggagccattt 
atctgtcaaa agaaatgact tgtcgattat ttctggctaa tttttcttta 
tctcacacct ggcgagctgt ggcatgcttt taaacagagt tcatttccag 
cagtgcaccc tgctttaaga aaatgaactt atgcaaatag acatccacag 
ttaaggggtg atcaccaagt ttcataatat tttcccttta taaaaggatt 

<210> 47 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 47 

acacagataa ttttaataca atgtgaaaaa gtgtatgggt gtgtagaaga ggggttctta 60 
gagtttctgg agagaatgat tctgagctcg gttttgacaa aagaggagct gctgaggcta 120 
aaagtggatg aaaagggcct tataattaaa agaaacaaga caggactcag aggtgtgaaa 180 
caaatattat gcatggtgaa ttacaatgag ttgggggtat tctgtagccc taaagtacaa 24 0 
ggtataaaga gacagaaaat gatcctggaa tatagacaga ggatacttca tctctcatga 300 

<210> 48 
<211> 300 
<212> DNA 



taacagacac 60 

tagcagagtt 12 0 

taccctccat 180 

cgtcggtaaa 24 0 

tgttggccag 300 



tangngacac 6 0 

tagcagagtt 120, 

taccctccat 180 

cgtcggtaaa 240 

tgttggccag 300 
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<213> Homo sapiens 
<400> 48 

gatggaacat gagtggaagt gggcagtctt tttctttccc tatcagctga gtgaatgaag 60 

atttagaggg cagcagagtc atgacatgga tgacgttggg tctctggatg gctaaatgga 120 

agacccgccc cccaacgcca ctctaccccc ctgctttgaa ctatgctttg agaaatgagc 180 

ttatgagacc actgagactt gggggctgtt tgttcagcag ttcacctaca cttattagga 240 

aaggttgact tcttgtaact acgcctttcc ttaaatcatc ttttgtataa ttctcagaag 300 



<210> 49 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 49 

ccctccccgg cttcccccgg agtgggtcac 
ttcattgact gtcacatgtg gtttgctggt 
atatgctcga tggatagcat tcactggaac 
agactagagg actctgggtt ccttttatgc 
agcaacaaaa ataaagagag gaccaggacg 

<210> 50 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 50 

ctcctgtctc agcctcctgg gtagctggga ctacaggtgc atgccaccat gcctggctaa 60 
cttttgtatt tttagtacag acagggtttc accacattgg tcaggctggt ctcgaactcc 120 
taacctcagg tgatccacct gccttggcct cccaaagtgc tgagattaca ggcgtgagcc 180 
accgcgcctg gcctgattgg ttttttaaca tgatttttct ctaagcttaa ataccacaag 240 
gccaaagaga aatggtcata atttaaacca ttattatatt ggtgaggtat ccctagctat 300 

<210> 51 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 51 

ggaggctaga ctcaagctgt ctggagagtg 
gtgtgactga gcttgatggc caagttgaaa 
gtaaccagga agaccttagt aaggactctc 
gagctggtac cagtatctca gagcctccgt 
gtggaacgct acctcttcct ttgagacctt 

<210> 52 
<211> 300 
<212> DNA 
<213> Homo sapiens 



cacactgttt tttatcatca 
ggcttcccac tggcgaagag 
cactatgaga agattatagg 
aaagtcaact cttctgggtc 
atgccagcac cccgtttatc 



tgggaatcat 60 

aagctacaaa 120 

aaaaacacca 180 

acagttaccc 240 

ctgagtgaac 300 



tgaaacaaaa gtgtgtgaag 
atcttcattt ggatctgtgc 
taggtcctac caaatcaagc 
ctcctatcag tccgtatgct 
gtggagaagg gtctgaaatg 



agttgtaact 60 

tgccttgctg 120 

aaaattgaag 180 

tcagaaagct 24 0 

gtaggcaaag 300 
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<400> 52 

atatggtata gttggaaata ggttattgtg agttatttgt agtcatgtct ttaatggccc 60 

ttgcatggtg tctaacttct gcaataaatg atctgccagt cctagtgtct ggctttatgc 120 

aatttgtttt cctttgtgga tgaagtggga gtaagacttg ttgctgtgag gattagatga 180 

agtggctagg atatggacac actttacttg aattggaaaa caagccatgt atccctaatc 24 0 

tgcaaaatgt ggcatgtcac acgtgtaatc tctgaggttt agtttttgct caagattgca 300 

<210> 53 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 53 

aagaagctct gcttggtact actattatga acaacattgt tatttggaat ttaaaaactg 6 0 

gtcaactcct gaaaaagatg cacattgatg attcttacca agcttcagtc tgtcacaaag 120 

cctattctga aatggggctt ctctttattg tcctgagtca tccctgtgcc aaagagagtg 180 

agtcgttgcg aagccctgtg tttcagctca ttgtgattaa ccctaagacg actctcagcg 24 0 

tgggtgtgat gctgtactgt cttcctccag ggcaggctgg caggttcctg gaaggtgacg 300 

<210> 54 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 54 

ccaagatgcc aatttccatg aagtcttgat ttatatatat gtacacatgt tatgcacata 60 

catgtttgtt ttctaacagt tattttttaa gcttttgaga taattttaga cttacagaag 120 

agttgtaaaa gtagtagagt tcttgtatac tctgcaccca ccttgccctt atgttaacat 180 

cttacgtaac aatagaacat ttgtcaaaat taagaaatta accttgatat aatactaact 24 0 

aaagtagaaa gtttaaaaag tagagatttt agtcttttca ctaatgtcct tttactgttc 300 

<210> 55 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 55 

gggagggacc cttgggggca ggttgtgggt agccagttgc agtctgtggc ctccctcaga 60 

ggtttggagt cgggcgtggc atgctgctgt tggcctcttt ccgagggagt gccatccact 12 0 

ccctgtccca ccgctgtccg cggtgaggac agtgagggca gtgctacgtg gtggggaggt 180 

gtgtgagaag ccacggaagg gcttcacagg gcagatgcca aggccagtgg gccccggaca 240 

gagtcaggct ccctgggcgg ccttgtgtct tggtggccct gatcatcctg ccaatgcaaa 300 

<210> 56 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 56 

ctttgctctc tccattccaa gttgttctct gttctagaaa gcagatgtag tagacatcta 60 
ctgtttttgc ctaaacagaa tccctttttc ctttttttgt taaaagtact catccctaat 120 
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attacattgt tctggaagga ctgaaaataa cagaactcag caeca tgatc ggacegggae 180 
aatcagatta tttcattcct cagcaaaegg agatcgatcc gaaaagtgga aatatgagct 240 
cttctttggt gttggcatat ggaccctgag agaaagaact ttaatttttt ctcttggact 



300 



<210> 57 
<211> 276 
<212> DNA 

<213> Homo sapiens 
<400> 57 

cctccctgga tgtgcagaca tggaggagga cagaaggccc agctcagtgg cccccgctcc 60 
ccacccccca cgcccgaaca gcaggggcag agecagnnnn nnntcgaagt gtgtccnngt 120 
tgtcttttga nccttgttnt ggngccttgc ctanatgtat ntnntntnnn tntnntnatt 180 
tnnnnnntnn ntnnnttnct nttnntaaat tgnttnnaan ttntnntann ttnttnnatt 24 0 
nnnannnnnn ntantgtnnt gnattgntat nnatca 276 

<210> 58 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 58 

ctgtaagtct ctttcttgcc catcaccaca tccctagtac tgggtatcag tctggccact 60 

tggctttctg gtttgcccca atgtggtcta ttcttgatgc agctaccaaa gtaatgtttt 120 

aaaaccatta taccaagtta ctatccttgt caaaaccccc agtaactgee aatctcactt 18 0 

agaataaaat ccggactcct gtgaagcaca gcataaactg gccactgcct atgeagcaac 240 

ctcatcttta ccgtttcctg ccttgctcac tcccttccag cgccgttatt cttcctgatg 300 

<210> 59 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 59 

gacccaggta gaccagctca agagttcatg ttctttgtca tcctngtgtg agctctctgt 60 

aagtctcttt cttgcccatc accacatccc tagtactggg tatcagtctg gccacttggc 120 

tttctggttt gccccaatgt ggtctattct tgatgeaget accaaagtaa tgttttaaaa 180 

ccattatacc aagttactat ccttgtcaaa acccccagta actgccaatc tcacttagaa 24 0 

taaaatcegg actcctgtga agcacagcat aaactggcca ctgcctatgc agcaacctca 300 

<210> 60 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 60 

gggtcctggt gggagttcca tccagcagtg agtgcatttt ttccccagag cagggtaagg 60 
gtcttattaa aagccaccac tttgetgagg cctgtacagg ccttgggggt ttggggaaga 120 
gaaataaggc aggcacttgt cccttcaggg agggacttgt ccctcactgg gaggtttggg 180 
gttgaccttg gctccagcag agatacccag cctggcgtgg aaggggcagg tctgagctta 240 
cgcttgactg cagggcaagc tgcaggcctc ttctgccttc ccctgcattc accaaggaca 300 
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<210> 61 

<211> 292 

<212> DNA 

<213> Homo sapiens 



<400> 61 

caaggcccga ggtgccatcc cctctgggaa gcagaagcct ggnggcaccc agagtgggta 60 

ctgtngnggt aaagngntca ccctctcaca gcaccaccag cggcgagaca gaccccacca 120 

ccatcttccc ctgcaaggag tgnggcaaag tcttcttcaa gatcaaaagc cgaaatgcac 180 

acatgaaaac tcacaggcag caggaggaac aacagaggcn aaaggctcag aaggcggctt 240 

tngcagctga gatggcagcc acgattgaga ggactacggg gcccgtgggg gc 292 



<210> 62 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 62 

agcaaatcaa gatcttcagg tacagttgga ccaggcactc cagcaagcct tggatcccaa 60 

tagtaaaggc aactctttgt ttgcagaggt ggaagatcga agggcagcaa tggaacgtca 120 

gcttatcagt atgaaagtca agtatcagtc actaaagaag caaaatgtat ttaacagaga 180 

acagatgcac agaatgaagt tacaaattgc cacgttgcta cagatgaaag ggtctcaaac 240 

tgaatttgag cagcaggaac ggttgcttgc catgttggag cataataatg gtgaaataaa 300 

<210> 63 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 63 

caggcctgga cttcgccccc aggcctagga ccgcggaggg tggaaccctg ctactgcccc 60 

aacagggact ccaatcaatc ggagttctcc ccttgccgga gctgcccttc acctttgggg 120 

cccgagacag tcataaggga tggacttagt tttcttgcag ggaaaaaggt ggacagccgt 180 

gtttcttaag gatgctgagg gcatggggcc aggaccaggg gagaggcaca gctccttcct 24 0 

gagcagcctc tcaccactgc cacaaggctc cctaatgctg gtctctgctc cactccccgg 300 

<210> 64 

<211> 294 

<212> DNA 

<213> Homo sapiens 

<400> 64 

gctgcatctg caatgaggat gccaccctac gctgcgctgg ctgcgatggg gacctcttct 60 

gtgcccgctg cttccggtgg gtgcaggtgg aatgttctgt gcgagagctc aagggctgcc 120 

tggatccctg acttgtatcc ctttgttcca cagagagggc catgatgcct ttgagcttaa 180 

agagcaccag acatctgcct actctcctcc acgtgcaggc caagagcact gaagacaccc 240 

tggtcctccc ggaagggcag tcccacaggc agcggcaccc atttctgggc cccg 294 



<210> 65 
<211> 300 
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<212> DNA 

<213> Homo sapiens 



<400> 65 

aattgatgag ccttattaac tatcttttca ttatgagaca aaggttctga ttatgcctac 60 

tggttgaaat tttttaatct agtcaagaag gaaaatttga tgaggaagga aggaatggat 120 

atcttcagaa gggcttcgcc taagctggaa catggataga ttccattcta acataaagat 180 

ctttaagttc aaatatagat gagttgactg gtagatttgg tggtagttgc tttctcggga 24 0 

tataagaagc aaaatcaact gctacaagta aagaggggat ggggaaggtg ttgcacattt 3 00 



<210> 66 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 66 

agcagatttg tgataaactt gctgtagaag aaaccaaagg ggaacttctg ttgcaactat 60 

gtcgtttgga agatgctgca gatgtttata gaggattgca agagagaaat cctgaaaact 120 

gggcctatta caaaggcttg gaaaaagcac tcaagccagc taatatgtta gaacggctaa 180 

aaatttatga ggaagcctgg actaaatatc ccaggggact ggtgccaaga aggctgccgt 240 

taaacttttt atctggtgag aagtttaaag aatgtttgga taagttccta aggatgaatt 3 00 

<210> 67 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 67 

tgttcttgta gtgtttgttg ctattgttag aaagattatt agtgatatgt ggggtgtctt 60 

anctaaacaa cagacacatg taagaaaaca ccagtttgat catggagagc tggtttacca 120 

tgcattgcaa ttgttagcat atacagccct tggtatttta attatgagac taaaactctt 180 

cttgacacca cacatgtgtg ttatggcatc actgatctgc tcaagacagc tatttggatg 240 

gctcttttgc aaagtacatc ctggtgctat tgagtttgct atattagcag caatgtcaat 300 



<210> 68 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 68 

agacaaagaa aaggtggcaa tcatagaaga gttagtagta ggttatgaaa cctctctaaa 60 

aagctgccgg ttatttaacc ccaatgatga tggaaaggag gaaccaccaa ccacattact 120 

ttgggtccag tactacttgg cacaacatta tgacaaaatt ggtcagccat ctattgcttt 180 

ggagtacata aatactgcta ttgaaagtac acctacatta atagaactct ttctcgtgaa 240 

agctaaaatc tataagcatg ctggaaatat taaagaagct gcaaggtgga tggatgaggc 300 



<210> 69 

<211> 300 

<212> DNA 

<213> Homo sapiens 
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<400> 69 

aattcnacac gaggtggccc ataagtttta ccttttaaac atccggctgc ctgtgaatga 60 

gaagaataaa atcaatgtgg gaattgggga gataaaggat atccggttgg tggggatcca 120 

ccacaatgga ggcttcacca aggcgtggtt tgccatgaag acctttctta cgcccagcat 180 

cttcatcatt atggtgtggt attggaggag gatcaccatg atgtcccgac ccccagtgct 240 

tctggaaaaa gtcatctttg cccttgggat ttccatgacc tttatcaata tcccagtagg 300 

<210> 70 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 70 

cccaaggcaa gctgttaaca aaatcaacct gggccaatca tcaaagggtt ggacctaagg 60 
ttgctatact caatagaaca agcattttaa ataaatttct cgtaagttgt tgctttcttt 120 
atgtggtggg tgtggcttta aagagcacaa aaccacaaca aatcaaagag tagctcgggc 180 
ttgtcttttg ctttatggct gagggtttga aggatgattc atggacttgt gaatgccagc 240 
cccagtcccg gcttaggtct atctgccaat accaccaggg ccaacaaatt cacgcaacaa 3 00 

<210> 71 . 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 71 

ggaaatgcaa gtcaaaacag ctttgtaggt ctcagagttt gcttttaaga agtagtacaa 60 
gaaggaatag ttatatcaat acaccagtgg ctgaaattat catgaaacca aatgttggac 120 
aaggcagcac aagtgtgcaa acagctatgg aaagtgaact cggagagtct agtgccacaa 180 
tcaataaaag actctgcaaa agtacaatag aactttcaga aaattcttta cttccagctt 24 0 

cttctatgtt gactggcaca caaagcttgc tgcaacctca tttagagagg gttgccatcg 3 00 

<210> 72 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 72 

ggattctttc actgagcaca aagagttgtt ggggctttag catctgactg attttgttac 60 
ggggttgatt ctgaccatag gaagtatgca atgtgaatca ctatttacag agaaacctac 120 
aacagatgct tgatgttgta gaaactggga catatagata ccaagcaaaa ttataagaaa 180 
cctataaggt gttcaatacg cttgtgtttc caaaattcac tgtacatgat cagtttggtg 240 
ttcttgtacc acagttttta actgaaggaa ccagttgtaa cagtctcaat tttaactaaa 300 

<210> 73 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 73 

ataacacaca tcacagtatg ctctcagaaa tttctttatt tgaaccctat accaatatct 60 
gttgatcaat gaccattttt gctcagcatg gagaaacagt gccctgcatg aagggtagtg 120 



18 



WO 99/33982 



PCT/US98/27610 



agaataaaaa ggatcttacc acctttatca tgagggtggc tttgctctct ccattccaag 180 
ttgttctctg ttctagaaag cagatgtagt agacatctac tgtttttgcc taaacagaat 240 
ccctttttcc tttttttgtt aaaagtactc atccctaata ttacattgtt ctggaaggac 300 

<210> 74 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 74 

cagagtcaac atggagcatc tcactgtgaa atgatccatg gattgaagga tatggtaaaa 6 0 

tgtttatagg ttactttgaa agtaaaatat actatgtctt ggttttgagg atattggata 120 

caaaactctc ttcctttagg gctactgaga cttgattcct gatcatcaga aatttcacca 180 

gaaacaactt gcttccaata tacccaattc tatatgaaga attcatggag agtgtactgg 240 

cactgnnnnn nnnnnnngan ncntgctgct ncgaanntnt nntattnact gannntgaat 300 



<210> 75 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 75 

caagagagag tgatagaatt ggcagtgaaa tatacgaacc accctcctgc cctctgggtt 60 

cacaatacgt gtacacttga ctgtgaagtg gctgtgagag tgggtggaga gttcttcttt 120 

gaccctcagc ctgcggatgc ctctagaaac ctcgtgttga ttgcaggagg agtcggaatt 180 

aaccctctgc tttccatcct gcggcacgca gcagatctcc tcagagagca ggcaaacaaa 240 

agaaatggat atgagatagg aacaataaaa ctattctaca gtgcaaaaaa taccagcgaa 3 00 

<210> 76 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 76 

gctagacgaa gtggtgaagc ccaaggactt atttttgagc tcgctgtaag actgagaaat 60 

cacgtactcc ttcctgaaac cactaagagg aaaaatgtct gtgacactgc atacagatgt 120 

aggtgatatt aaaattgaag tcttctgtga gaggacaccc agaacatgtg agatggagtc 180 

tcgctgtgtc ccccaggctg gagtacaatg gcgcgatctc ggctcactgc aacctccgcc 240 

tactgggttc aagcaagtct tctgcctcag cctcccgaga actgcaagag gaggcaactg 300 

<210> 77 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 77 

agagactttt gtttgtgttt aattagggct atgagagatt tcaggtgaga agttaaacct 60 

gagacagaga gcaagtaagc tgtccctttt aactgttttt ctttggtctt tagtcaccca 120 

gttgcacact ggcattttct tgctgcaagc ttttttaaat ttctgaactc aaggcagtgg 18 0 

cagaagatgt cagtcacctc tgataactgg aaaaatgggt ctcttgggcc ctggcactgg 240 
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ttctccatgg cctcagccac agggtcccct tggaccccct ctcttccctc cagatcccag 300 

<210> 78 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 78 

caggagcaat caattcctgt cgaagtgaat accatgcagc ttttaacagt atgatgatgg 60 

aacgcatgac cacagatatc aatgcactga agcggcagta ctctcgaatt aaaaagaagc 120 

aacagcagca ggttcatcag gtgtacatca gggcagacaa agggccagtg accagcattc 180 

tcccgtctca ggtaaacagt tctccagtta taaaccacct tcttttagga aagaagatga 240 

aaatgactaa cagagctgcc aagaatgctg tcatccacat ccctggtcac acaggaggga 300 



<210> 79 

<211> 278 

<212> DNA 

<213> Homo sapiens 



<400> 79 

gtgctgcaga ggaagacagc ctgtcaggat actgacgagg aggaggaaga ggaagatgat 60 

gatcaggctg aatacgacgc catgttgctg gagcacgctg gagaggccat ccctgccctg 12 0 

gcagccgcgg ctgggggaga ctcctttgcc ccattctttg ccggtttcct gccattattg 180 

gtgtgcaaga caaaacaggg ctgcacagtg gcagagaagt cctttgcagt ggggaccttg 240 

gcagagacta ttcagggcct gggtgctgct cagcccag 278 



<210> 80 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 80 

ggaacttctg agtaattggt atcatttcct agtgactcgg ctcttgtact ccaatcccac 60 

agtaaaaccc attgatctgc actactatgc ccagtccagc ctggacctgt ttctgggagg 120 

tgagagcagc ccagaacccc tggacaacat cttgttggca gcctttgagt ttgacatcca 180 

tcaagtaatc aaagagtgca gcatcgccct gagcaactgg tggtttgtgg cccacctgac 240 

agacctgctg gaccactgca agctcctcca gtcacacaac ctctatttcg gttccaacat 300 



<210> 81 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 81 

acctgtaccg cctggccact ggctgtcacc ggcgtgatga gctgccggtg ttngaacgca 60 

acctatgctg gactctcccg gcagactgcc tggatatggt cgccatgcag gaagccgccc 120 

agcacctcct cggcacacac gacttcagcg ccttccagtc cgctggcagc ccggtgccga 180 

gccccgtgcg aacgctgcgc cgggtctccg tttccccagg ccaagccagc cccttggtca 240 

cccccgagga gagcaggaag ctgcggttct ggaacctgga gtttgagagc cagtctttcc 300 



<210> 82 
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<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 82 

cccagctgga cctggtggcc ctttcctagt gcctctgctg ggggaggaga acctgggtcc 60 

acgtggaggc taggaggtct caggtgctgc cctggcagca ccagagtgtg ggccgggccc 120 

gagtgtctgc ccctcggccc tcagggtggg gcacttagca cccagaaggg accaaaagca 180 

gggcatggcg gtgcagagga gtttgggagg tgtaaacagc cccatgcacg tggaggagga 24 0 

gctggctttc agccccagac cccacgctag cactttccac gctgcttgcc cgctgttgat 3 00 



<210> 83 

<211> 272 

<212> DNA 

<213> Homo sapiens 



<400> 83 

tctagatatt gcccaatcgc tgcccacagt gcacatacct ttccaccagt cacatgtgag 60 

agggcagatt ttccaaatgc tcatcaccac ttggcactgt gtggactata attttggcca 120 

gttaggaaat ggcatctcat tgttttcatc ttaatttgcg tcagcctgat tactcattga 180 

aacttgtgag gttgagaaac ttttcttaag cttattggcc attcaagttt cctcctttat 240 

gaaatggttg ttcatgtcat ttgctcattt tt 272 

<210> 84 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 84 

cccactgccc ccggtcaaca aacccacttt tatgacagtt ttcttccgca gcttggctct 60 

taaattttac tggcaggtgt atggttgttg gagggttcct agtgagttgg gggacctggc 120 

aatagagctg cttggttgga ggaagtgaag ctggcttagt accagcagct gatctcttcc 180 

acgtgctgct gctttttttg ccactctgat actaaaccag agaaagctgc aggtggataa 240 

agaagctgtg gctgtttttt gcttttgggt ggcaatgaga aagagtcaca gtgtgggtta 300 



<210> 85 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 85 

ctgggagcca ccaacatagc agattaccat gtgaagttgc cactgctgca tctcctgaaa 60 

cctggctgat gggagaggtc tcattttgtg tctgagaatg tccaggttgt ctgcagacca 120 

cagcactgat ttcccattag cagttattat ttcctggcca tttcttcctg aaggttttgt 180 

ggttaaactc cctgtcctca atattttatc agcagtaggg ctgtcattct tctggttatc 240 

aacctctaca ttatgaagta aggttcaacc cttctgcttt tctcaggccc ccaaaacggt 300 



<210> 86 

<211> 300 

<212> DNA 

<213> Homo sapiens 
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<400> 86 

agaacattgg tgtgtgagtg ttttttgatg gtgcaggacc cggaggtgct ttccttgcca 60 

agaatagaaa catccagaat gctcctcccc atcccccaat cccagacagc aattatgtca 120 

gccctgtaag gcattgcctg ctcttgaccc tttggcccat ctttttattt ttaaaaaatt 180 

cccatgtcac agatgccctg tctatgcaga gggtggcgtg ggatgggtga ccactaagtt 240 

taggctggtg aaggtggtga gcccttctga ggccctgata gaactttcca ggagttcatg 300 

<210> 87 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 87 

ctccaaggaa aatccacctc gcagcttgta aatctacagc ctgattacat caaccccaga 60 

gccgtgcagc tgggctccct tctcgtccgc ggcctcacca ctctggtttt agtcaacagc 120 

gcatgtggct tcccctggaa gacgagtgat ttcatgccct ggaatgtatt tgacgggaag 180 

ctttttcatc agaagtactt gcaatctgaa aagggttatg ctgtggaggt tcttttagaa 240 

caaaatagat ctcggctcac caaattccac aacctgaagg cagtcgtctg caaggcctgc 300 



<210> 88 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 88 

ctgaaacaaa agatgtattt caattaaaag acttggagaa gattgctccc 
gcattactgc tatgtcagta aaagaagtcc ttcaaagctt agttgatgat 
actgtgagag gatcggaact tctaattatt attgggcttt tccaagtaaa 
caaggaaaca taagttggag gttctggaat ctcagttgtc tgagggaagt 
caagcctaca gaaaagcatt gagaaagcta aaattggccg atgtgaaacg 

<210> 89 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 89 

ggggacatgt gtccctcagc tcagcagagg ctgtggtaca acatggtcct 
ctgcacccct ggaacctccc accatcgtca caactgtagt ctcatttgca 
gaacccgatg tcccacagcc agatatacac ccagctccat gccagccctt 
ttttgctttg ttaattacat gtcagactcc tagagggcct ccagactaat 
tctgtaacca acctgccacc cactgattca gaaatggaaa tcacattcca 

<210> 90 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 90 

ctcatacaga aagtcagatc aacaaagagt ccaagaaaaa tgcgacccag ctagaccatt 60 



aaagagaaag 60 

ggtatggttg 120 

gctcttcatg 180 

caaaagcatg 240 

gaagagcgaa 300 



tggtgaagac 60 

gtggagaaaa 120 

catgtttacc 180 

aggaagcatt 240 

caatctatgg 300 
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tgatcccagg cttagcacac gattgcatgg catccccttt agccacttca accactgcag 120 

acatccagga agctggactc tctcctcagt ccctccagac ttctggccac cacagaatga 180 

aaaccccatt ttcaactgag ctatctttgc tccagcctga tactccagac tgtgctggag 240 

atagtcatac cccactggct ttttccttca ccgaggactt ggaaagttct tgtttgctag 300 



<210> 91 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 91 

aatgcaaagg gctgcagttc tcattcaggc tactttcagg atgcacagaa catatattac 60 

atttcagact tggaaacatg cttcaattct aattcagcaa cattatcgaa catatagagc 120 

tgcaaaattg caaagagaaa attatatcag acaatggcat tctgctgtgg ttattcaggc 180 

tgcatataaa ggaatgaaag caagacaact tttaagggaa aaacacaaag cttctattgt 240 

aatacaaggc acctacagaa tgtataggca gtattgtttc taccaaaagc ttcagtgggc 300 



<210> 92 

<211> 300 

<212> DNA 

<213> Homo 



sapiens * 



<400> 92 

aagatatgca gagatattcc aggatctttt agctttggtg cggtctcctg gagacagtgt 60 

tattcgccaa cagtgtgttg aatatgtcac atccattttg cagtctctct gtgatcagga 120 

cattgcactt atcttaccaa gctcttctga aggttctatt tctgaactgg agcagctctc 180 

caattctcta ccaaataaag aattgatgac ctcaatctgt gactgtctgt tggctacgct 240 

agctaactct gagagcagtt acaactgttt actgacatgt gtcagaacaa tgatgtttct 3 00 

<210> 93 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 93 

cgattcgcca gttctccatt ctgagagtca atcacgttcc tgataggttg tcattgattt 60 

ttttcttcgt tggttttaac cttctaaaca tctccaggcc actttcttag cctttttcta 120 

ggtactaaaa agaggtccta cccacacctg cctcacactt ctcctttcca aggctgcctg 180 

agtttggagg ggcttgggtg tgtgtgaaca agggccctgc attgtctagg cctgcagttc 240 

ccaggcttgg gttcactttc accatgcatt ggcaaaacta gaaaagtaag cttgtgacaa 300 



<210> 94 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 94 

tttgtgcctg agcacccaca atttcaggat ttagactgtg tggcacctca gctttcctct 60 

ggatgtaacc actccttggt gagagaggga actcctcacc aatcccattt gacaaaggct 120 

aggcaatctt cattctgctt ggctttagtc attcttgtca ttgggctgca gaagaaaaac 180 

aactttgctg ggtgatccca ctgccttgat ttcacctcgg agcgaggctg ggccatgtcc 24 0 
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aagtcttatg aggtcaccct gactagaaaa aattgaactc acctacaaat agtctgaaag 3 00 

<210> 95 

<21X> 300 

<212> DNA 

<213> Homo sapiens 

<400> 95 

gtgagtccga gcatcagtgg cttctggagc agaccagcca cgtggaagag aagccttaca 60 

gagatgggtc ggcagagccc tgctgatggc tgggccttgt gggcagccac tctgtgtgag 120 

cagggtgttg ggcccataca cttcaaagac cagagccctg cactgggaga gtgctcctgg 180 

cccaggctgg gaatcacctt tcgaggccct tcagactctg gcggggcttg ctgtggcctc 24 0 

cctccagcta gtggtgtggc tgagcagact ccagggccag ggccagttcc cttctcccct 300 

<210> 96 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 96 

acaactccag acataattaa agactggccc aggaggaaga gggcggtggg ctgtggcgcc 60 

ggctcctctt ccgggagggg cgaggtcggt gcagaccttc ctgggagcct gtcactgctt 120 

gagacagagg gcaaggacca cggccttgaa ctcagcatcc acaggacgcc catcttggag 180 

gattttgagc tcgagggagt gtgccagctc ccagaccagt cgcctcccag gaacagcatg 240 

cctaaggccg aggaagcctc ttcctgggga cagtttgggt tgagttccag gaagagagtc 300 

<210> 97 

<211> 286 

<212> DNA 

<213> Homo sapiens 

<400> 97 

gtccagggcc cangttttaa tttnttttta aaaagcttta ggtcttgccg ggacggtggt 60 

tcacncnnnn nnnnnnnnnn nnnnnnnagg cctaggcggg tggatcacaa ggtcagcagt 120 

tcaagaccag cctgaccagc atggtgagac cctgtctcta ctggaaatac aaaaaaattg 180 

gctgggcgag gtggcaggca cctgtggtcc cagctacctg ggaggctgag gcgggagagt 240 

ctcttgaaac tggaaggcag aggttgcggt gagccgagat tgcgcc 286 

<210> 98 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 98 



ttattctgtt 


tttgtaaaac 


agtttcggga 


60 


ttgttattat 


attataattt 


tgtaaaccca 


120 


tcgactcatg 


gatgatacct 


ctattgctgc 


180 


ttgttacaga 


gatgttgggc 


agagctatgc 


240 


catggtaaaa 


agttaaccct 


ttctattttt 


300 



<210> 99 
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<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 99 

agcctcgcct gggccggcct gtggctccca ttttcctttc agcgggacaa aggggacttg 60 

ttaccaggcc attttctgga tggcctgtga gatctctgcc cctccaagac cctccaagtc 12 0 

tgagcctgac ccacagctgg gacactgaat tcagccctgg gaaccatggg ggcttctatc 180 

tggcaccagg ctgcagcctc cccaatccca gcccactttg ctgtgtctct ggcgggctgt 240 

cctccttggt gggagctgtc ctgcacactg taggatgctt aaaggtatcc ctggcctcca 300 

<210> 100 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 100 

tccaaccctg gcgatgtcac cagcatggtg gctcaggtta gagctctctg aggacccagc 60 

atagagcact ggtgccaggg accaaactga gaccccacca ccgtcatcaa cacttacata 120 

ccataaaggt cttcagagtg ccttggccct agacctccct tcattctttg tagagatgga 180 

atctaagaat gaaacatctc cactcagtcc tgcaaatatg gaagttcttg agataccttt 240 

ttttggtaga tacttgtgct ggtattctga gagtcacttt actctgatgg tttgcaagat 300 

<210> 101 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 101 

gtgtttcttc tacctcccct gcacaacatt gtttatatgc cccctaaaat gtaacttctt 60 

tagattctgt tgttacgtgc aacactgtat atctctccat agcacttaat cagagtttgt 120 

aattaggcat ctttttgtgt gattatttgg taaatgtcca tatcccctac tagcctataa 180 

gctccatgac ttctaggtac cctgtctgac tacgtgtatc actggttcta ccgcctaaca 240 

ttgcctagca cattcattgc ttcacaggca tctgaatatg ggtttataaa atacattgct 300 

<210> 102 
<211> 270 
<212> DNA 

<213> Homo sapiens 
<400> 102 

cctggccctg ctgcccctcc tgaatctcgt atgatggtca cagtccggtg gccgtggggg 60 
tgctctgcct tccctggtcc ccactgccca tatctgtgga ctgccccttc caaagacccc 120 
tggaggaggt gtnnnnnnnn nnnttnntgn ncccactacc ntgcactgaa ctggccntgt 180 
tacancaann actgnncccn nttgttatna cacctntnac aaacacctgc tgctgtacat 240 
gncnctactt taaggactnn anacctgtgc 270 

<210> 103 
<211> 300 
<212> DNA 

<213> Homo sapiens 
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<400> 103 

gctggagcac gctggagagg ccatccctgc cctggcagcc gcggctgggg gagactcctt 60 

tgccccattc tttgccggtt tcctgccatt attggtgtgc aagacaaaac agggctgcac 120 

agtggcagag aagtcctttg cagtggggac cttggcagag actattcagg gcctgggtgc 180 

tgcctcagcc cagtttgtgt ctcggctgct ccctgtgctg ttgagcaccg cccaagaggc 24 0 

agaccccgag gtgcgaagca atgccatctt cgggatgggc gtgctggcag agcatggggg 300 

<210> 104 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 104 

ctcgcgtctc ttcactgcac attgcaatgc atttgcgatt cccatttctc tgctaggagc 60 

cagcctgggt ggcgctgctc ccagagccgt gggtcccaga ccttgcgttc cttttgttcc 120 

tgtccgttta tcaggacacg ggccccacct gtcacgtgcc cgaggccacc caagcccagc 180 

ctgcggggcg ttcccactgc ctggatgccg gcttgagttc tgcgcacgca ggattcagtg 240 

tggggacggc ccctgccgga taggcctagc cctggcccag gtggtgagcg gtttgcagtg 300 

<210> 105 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 105 

gggcactgtg gggctctccc cgcctctcct gccttgtttg cccctcagcg tgccaggcag 60 

actgggggca ggacagccgg aagctgagac caaggctcct cacagaaggg cccaggaagt 120 

ccccgccctt gggacagcct cctccgtagc ccctgcacgg caccagttcc ccgagggacg 18 0 

cagcaggccg cctcccgcag cggccgtggg tctgcacagc ccagcccagc ccaaggcccc 240 

caggagctgg gactctgcta cacccagtga aatgctgtgt cccttctccc ccgtgcccct 300 

<210> 106 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 106 

gctcaacgcc tatgtgaccc atctccatgc cgaatacaat cgacagaagg acatctacct 60 

agcacatcgt gtggcccaag cttgggaatt ggcccagttc atccaccaca catccaagaa 12 0 

ggcagacgtg gttctgttgt gtggagacct caacatgcac ccagaagacc tgggctgctg 180 

cctgctgaag gagtggacag ggcttcatga tgcctatctt gaaactcggg acttcaaggg 240 

ctctgaggaa ggcaacacaa tggtacccaa gaactgctac gtcagccagc aggagctgaa 300 

<210> 107 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 107 

tgtgagtttc ctatctgttc cagactagta tcgccaatct ctcccagctc tcttctttcc 60 
tccctggcct ttgtcctgca ggaggtagca tcacctcttg gcattttgta catgctttta 12 0 
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aacaattgga ggagctgccc aggcagtttt atggcctcct ggttgtgtgc cttcacaccc 180 
gcctacagcc ccacctcacc atcaagcgct gagccaatgc gggtgtggct ggccctgagt 24 0 
tcctgagtca gctccttgcc agggccagag ctggtaacag cggggcagca gggtgggtag 300 

<210> 108 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 108 

aggttgctca cctgaaggag cacaggaggg ttttccaggc catgtggctc aggttcctca 60 

agcacaagct gcccctcagc ctctacaaga aggtgctgct gattgtgcat gacgccatcc 120 

tgccgcagct ggcgcagccc acgctcatga tcgacttcct cacccgcgcc tgcgacctcg 180 

ggggggccct cagcctcttg gccttgaacg ggctgttcat cttgattcac aaacacaacc 240 

tggagtaccc tgacttctac cggaagctct acggcctctt ggacccctct gtctttcacg 3 00 



<210> 109 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 109 

cacaaggaga agaaagttaa ttaacattga aagatgagaa gacatcttgg aagaacttga 60 

attgggcctt ggaagaagaa cagccattca aatagataga attgtggtag caaaggcata 120 

gaggtaggaa agtatagatc tccagggaca gtagtcatgg ggttggggca ctgttggaat 180 

ttaaggttgg aaggatatat tggagcccct tgaatacggt aacaaggcac accttgggca 24 0 

gtggagagtt atcagagtgt ttgaaaagga gggttattga gtaaataaat agactggtac 300 

<210> 110 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 110 

gacaccccag atgcagccac caccagcaga agcgatcagc tgaccccaca agggcacgtg 60 

gctgtggccg tgggctcagg tggcagctat ggagccgagg atgaggtgga ggaggagagt 120 

gacaaggccg cgctcctgca ggagcagcag cagcagcagc agccgggatt ctggaccttc 180 

agctactatc agagcttctt tgacgtggac acctcacagg tcctggaccg gatcaaaggc 240 

tcactgctgc cccggcctgg ccacaacttt gtgcggcacc atctgcggaa tcggccggat 300 



<210> 111 
<211> 271 
<212> DNA 

<213> Homo sapiens 



<400> 111 

cctggccctg ctgcccctcc tgaatctcgt atgatggtca cagtccggtg gccgtggggg 60 

tgctctgcct tccctggtcc ccactgccca tatctgtgga ctgccccttc caaagacccc 120 

tggggggggt ggggnnttcc ttctannccn ntacnctatg tgtttaatnn ncntantnct 180 

ttantantat ttnccantgn tnntnatatn nttnnanana nncntnctta nnnacattat 240 

tttanntang ngatnntacc ttnntgnaan g 271 
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<210> 112 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<4O0> 112 

gttccctcac cttattcctc caagttcccc cttgggaacc tctgagatta acttgataag 60 

ctccttgggc aagctcttta tcctaagatt cctcagtgag ccttatagag ttgctgcgag 120 

aattacattt gttcatgatg tcaagtgtct ggtatgtagc taatgcttat tgaacacata 180 

gtaatttatt gaataattgt catgatcact ggatgagata tagccactgt ggaggtaggc 240 

acaccagggt tttagaggct tgggatcttg caacaggatt ttcctcttgc ctctccaaac 300 

<210> 113 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 113 

cccacatgta ccaggttgag tttgaagatg gatcccagat agcaatgaag agagaggaca 60 

tctacacttt agatgaagag ttacccaaga gagtgaaagc tcgattttcc acagcctctg 120 

acatgcgatt tgaagacacg ttttatggag cagacattat ccaaggggag agaaagagac 180 

aaagagtgct gagctccagg tttaagaatg aatatgtggc cgaccctgta taccgcactt 240 

ttttgaagag ctctttccag aagaagtgcc agaagagaca gtagtctgca tacatcgctg 300 



<210> 114 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 114 

acagttagtg taaaggatct gaatggcata gacttaactc ctgtgcaaga tactcctgtg 60 
gcttcaagaa aagaagatac atatgtacat tttaatgtgg acattgagct ccagaagcat 120 
gttgaaaaat taaccaaagg tgcagctatc ttctttgaat tcaaacacta caagcctaaa 180 
aaaaggttta ccagcaccaa gtgttttgct ttcatggaga tggatgaaat taaacctggg 240 
ccaattgtaa tagaactata cacgaaaccc actgacttta aaagaaagaa attgcaatta 300 

<210> 115 
<211> 288 
<212> DNA 

<213> Homo sapiens 
<400> 115 

gtgatctgcc tgccttggtc tcccaaagtg ctgggaatac aggcatgagc caccgcactc 60 
ggccaggagc tagttttatc agcatcctgc tccactgcct tcctctagtg cagcctggaa 120 
gacatggcag cgggtagctc ctggggctga gccagaagca tcactgcagt gaaagtctct 180 
gcttacctgt ctggctcagc ttgggcaagg gctgggccat atgtgctcag ggacgtgctt 240 
ctcttgtaag gcaggaggat anaanaggac cannaanggn gggagctg 288 

<210> 116 
<211> 300 
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<212> DNA 

<213> Homo sapiens 

aaaaaggatg ccaaaattgc 60 

gcatgtgaag caacattgtt 120 

tgcttagatt gttacaaggc 180 

tggatgaagt gtgtgaaggg 240 

atacctggtt ctgttttgac 300 

<210> 117 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 116 
tcaattagta acatctgaaa 
ctggaaaaga gcagtgagag 
taacattcac tgggtctgcc 
aaaggaaagg aagagttcta 
acagcctcat gatcacaaac 



aaacagcttt gtcctgggtg 
gagtccggga gatgtgtgat 
aaaaatgtgg atttgtggtc 
gagataaaga actatatgct 
atttaatgcc aacccaaatt 



<400> 117 

gcactttcca gaattctctc atatttgtgg gctgggatca agcctgcagc ttgaggaaag 60 

cacaaggaaa ggaaagaaga tctggtggaa agctcaggtg gcagcggact ctgactccac 120 

tgaggaactg cctcagaagc tgcgatcaca actttggctg aagcccctgc ctcactctag 180 

ggcacctgac ctggcctctt gcctaaacca caaggctaag ggctatagac aatggtttcc 24 0 

ttaggaacag taaaccagtt tttctaggga tggcccttgg ctgggggatg acagtgtggg 300 

<210> 118 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 118 

agaacgttct caggttgacc agctgctgaa tatttcttta agggaggaag aacttagtaa 60 

gtcattgcag tgcatggata acaatcttct gcaagcccgt gcagcccttc agacagctta 120 

tgtggaagtt cagaggctac ttatgctcaa gcagcagata actatggaga tgagtgcact 180 

gaggacccat agaatacaga ttctacaggg attacaagaa acatatgaac cttctgagca 240 

cccaggtttg gcatagaaat ggtacccctt gttcaaaatg aacaagaagc cttagatttg 3 00 



<210> 119 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 119 

gaacaaagaa ggaatgtctt cctcatgttt gggtctatag aagacgttaa agaaaacttc 60 

cagaaagtgg gtttgaggca tgagccacca cgcctggcca aaggatttaa tgaattaatg 120 

gatgtacagt gctggggctg gtattctagg gcctgcattg agactcacat tttgccatca 180 

aaagcctttt aagaggtgga ggttgcggtg agctgacatg gtgccactgc actccggcct 24 0 

gagtgacaga gtgagactct gtctcacaaa aaaaataatg ccctttaaat aatgaataat 300 



<210> 120 

<211> 273 

<212> DNA 

<213> Homo sapiens 
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<400> 120 

cctcagcctt ctaaaaagct ggggctacac ccagctgaag aaattgtaac taaagataga 60 

ttgtttaaag caaagcaaga aacttctgaa gaaatggaac aaagtggaga agcctcagga 120 

aagcccaaca gagagtgtgc accccagatt ccttgtagta ctcctattgc tactgaaagg 180 

acagttgcac atttgaacac tctgaaggac cgtcacccag gtgatttgtg ggcccgcatg 240 

cacatctcat cccttggaat atgctgcagg aga 2 73 

<210> 121 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 121 

agaacgttct caggttgacc agctgctgaa tatttcttta agggaggaag aacttagtaa 60 

gtcattgcag tgcatggata acaatcttct gcaagcccgt gcagcccttc agacagctta 120 

tgtggaagtt cagaggctac ttatgctcaa gcagcagata actatggaga tgagtgcact 180 

gaggacccat agaatacaga ttctacaggg attacaagaa acatatgaac cttctgagca 240 

cccaggtttg gcatagaaat ggtacccctt gttcaaaatg aacaagaagc cttagatttg 300 

<210> 122 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 122 

gttgcaagca gccttggaat agtaactctt ctcatttgtt tgggatctgg ccaccaagtt 60 

ccagaatgat acacggatca gtgcagaagt tcatcaggct ctcggacctt agggctgttg 120 

gagaaggctt cagcagcaga actgatggtg aaggctcgtg ttctccatcc tcaactttct 180 

ttgcttcgat catacacaag aatacatttg gaagggcaaa aaaatgaaca ctgtcgttca 240 

ttgcagccgt gttttgtgac acagatgcac agtctgctgt gaagaccttc tctcaagtgg 300 

<210> 123 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 123 

gtgatttcag cttccaaact ggtatacatt ccaaactgat agtacattgc catctccagg 60 
aagacttgac ggctttggga ttttgtttaa acttttataa taaggatcct aagactgttg 120 
cctttaaata gcaaagcagc ctacctggag gctaagtctg ggcagtgggc tggcccctgg 180 
tgtgagcatt agaccagcca cagtgcctga ttggtatagc cttatgtgct ttcctacaaa 240 
atggaattgg aggccgggcg cagtggctca cgcctgtaat cccagcactt tgggaggcca 300 

<210> 124 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 124 

catgctggcc agcatccctg cctgtgcaag ctctggatga gctgtgagcc cctgccaccc 60 
acacccccac tccctgccag cctggcctca gggcctctga tccatgtgca ctggagagga 120 
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gatgactgac agggccactg gggcatttcc acgttaacag cagctgccac tggcaaaaga 180 
agtgactcgc caatggaggc atctcagatg tgggcccagg agtctgggga gctactttga 240 
acagggctat ccattcattg tcccaccaaa ggctatggag cccacccacc atgtgctgga 300 

<210> 125 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 125 

ggtaaattgg ttgaattatt gtattgaagc ttgagctgta gctaaaagta atttaggttt 60 
cccctaagat gttattatgt tagggacata acacttttgg gaggttgttg tgggagatgg 120 
ttgatttagg ttttcaaaag ctagaaataa aatttacatg ccttagattt cataaaattc 180 
tgctctaatt gggtggaagg tgctgtatct aacttgtgtt cctcctaagg ttatgtccta 240 
ataactattc ttttaggagt atacttctac tttatagaag gttgcttttc tttttaattt 300 

<210> 126 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 126 

tgaagaggag atcggtgacc tgggctcctt atgtgcctga aagagtttga gtttcctgtt 60 
aactccaaat caacagtatt ttcaacaaga aatgtgcaat tgaaatcaag tgctgtttaa 120 
gtgcagctag gatttccaca ggaagacact tgcagtgaac agagttatgg agcagcaaaa 180 
acacagatct atttggaaaa agagaaaaca tatgcgttgt attttgcttc aattataaaa 24 0 

taccatcctc tcaaaggtgg ttctaaatta caaaggactt tgatttctag gtagattctg 300 

<210> 127 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 127 

ggtgattccc atgctgaaca gtttgatctc ctgccagagt gtcgggccac aaactgggca 60 

gcacatcagg atcacctggg ggccttcaaa aatcaaaaat ccacccccag gccatgccct 120 

ggacccactg caccaggaca agaaatccac cccaggcctc tccccagacc cactgcacca 180 

ggacaagaaa tccaccccca ggccacgccc cagacccact gccctaggat gtgggggtgg .240 

gaaccaggtg gtgctttgta aagacgtgca ggtggtaacc ccaggccccc acgctcggaa 300 

<210> 128 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 128 

tgagctggga gaaggggaga aagtttgtga agaggagatc ggtgacctgg gctccttatg 60 

tgcctgaaag agtttgagtt tcctgttaac tccaaatcaa cagtattttc aacaagaaat 120 

gtgcaattga aatcaagtgc tgtttaagtg cagctaggat ttccacagga agacacttgc 180 

agtgaacaga gttatggagc agcaaaaaca cagatctatt tggaaaaaga gaaaacatat 240 

gcgttgtatt ttgcttcaat tataaaatac catcctctca aaggtggttc taaattacaa 300 
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<210> 129 
<211> 285 
<212> DNA 

<213> Homo sapiens 
<400> 129 

ggaaagcaca aggaaaggaa agaagatctg gtggaaagct caggtggcag cggactctga 60 
ctccactgag gaactgcctc agaagctgcg atcacaactt tggctgaagc ccctgcctca 120 
ctctagggca cctgacctgg cctcttgcct aaaccacaag gctaagggct atagacaatg 180 
gtttccttag gaacagtaaa ccagtttttc tagggatggc ccttggctgg gggatnnnnn 240 
nnnnnnnnnn nnnnnnnnnn nnaggaagat accatttctt gacgg 285 

<210> 130 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 130 

ccggacgcag gccctcgggc aggagcatct ggcagagtgg ggggcgtggc aggcaccctc 60 

ctttgcaggg cgaggtgggg cctctgcagc catcctggac aggccggggt ggcggcagct 120 

ttgcccacgt ggaagcgggg tgggtctcac ttgcgtggtg gcccctggcc ccatcttgcc 18 0 

tgctgcggcc tggggagcag gcgctgggtg gtggttctgc ctgcttgctg ctcgttcccc 24 0 

gggcatgcgt gggcagcggg gggcatgcgt gggcagcagg gggccgtggg cagcgggggc 300 

<210> 131 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 131 

gatctctata ctagtgaaca gtgccagttc cacactttgg acttagaact gttctctagt 60 
tattgtaaca cagaatactg tcaatcccta atttacttaa tgttacttat tggaagtggg 120 
gctgatgaaa tacgcacagg agggaaatct actgtgttta ggcacaggca gccccagtgt 180 
ataaggagat catattccaa aaggttgtca gttggttgtt tgcaacctgg aatgtatttt 24 0 
cctttagaga ccaggttatc catggtggtt aggcccctag agcagctgga aaagatgatc 300 

<210> 132 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 132 

ctcccatgga ggtggtggga atggcaccga gaagtttgat gacagttatc taatggacta 60 
gaggttggca aactttctgt aaatggccag gtagtaaata gttctgcttt tgaaggcata 120 
tggtctcttg cacctactcg aggctgaaag cagctataga caatacataa atgaatgagc 180 
gtgagtgtgt tccaataaga aaaaaacatg gctgtttgct tcggccccag ggttgtagct 24 0 
taccagtcct gtaacagatc acagtttgct cttttggtca caaatacttg aacccctccc 300 

<210> 133 
<211> 269 
<212> DNA 
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<213> Homo sapiens 



<400> 133 

atgctatgcc aaagcctgct gccagctcca tagcctggac ctacagcact gcatggtgga 60 

gtccacagct gtggtgagct tcttggagga ggcagggtcc cgaatgcgca agttgtggct 120 

gacctacagc tcccagacga cagccatcct gggcgcactg ctgggcagct gctgccccca 180 

gctccaggtc ctggaggtga gcaccggcat caaccgtaat agcattcccc ttcagctgcc 240 

tgtccaggct ntgcaaaaag gctgccctc 269 

<210> 134 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 134 

gatggatgag actgttgctg agttcatcaa gaggaccatc ttgaaaatcc ccatgaatga 60 

actgacaaca atcctgaagg cctgggattt tttgtctgaa aatcaactgc agactgtaaa 120 

tttccgacag agaaaggaat ctgtagttca gcacttgatc catctgtgtg aggaaaagcg 180 

tgcaagtatc agtgatgctg ccctgttaga catcatttat atgcaatttc atcagcacca 240 

gaaagtttgg gatgtttttc agatgagtaa aggaccaggt gaagatgttg acctttttga 300 



<210> 135 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 135 

ggcgagcggg aacagctctt gaggagtgag actgcaggag atgtgggccg tgccaaagag 60 

atggatgaga ctgttgctga gttcatcaag aggaccatct tgaaaatccc catgaatgaa 120 

ctgacaacaa tcctgaaggc ctgggatttt ttgtctgaaa atcaactgca gactgtaaat 18 0 

ttccgacaga gaaaggaatc tgtagttcag cacttgatcc atctgtgtga ggaaaagcgt 24 0 

gcaagtatca gtgatgctgc cctgttagac atcatttata tgcaatttca tcagcaccag 300 

<210> 136 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 136 

gacttctaaa tatatcttgg atataatagg tgataagttc tgtcaattag taacatctga 60 

aaaaacagct ttgtcctggg tgaaaaagga tgccaaaatt gcctggaaaa gagcagtgag 120 

aggagtccgg gagatgtgtg atgcatgtga agcaacattg tttaacattc actgggtctg 180 

ccaaaaatgt ggatttgtgg tctgcttaga ttgttacaag gcaaaggaaa ggaagagttc 240 

tagagataaa gaactatatg cttggatgaa gtgtgtgaag ggacagcctc atgatcacaa 300 

<210> 137 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 137 
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ttgacaaatt gctggaacac acttattgtg gtttacccgg ttttaattat gtcagagatt 60 

gcatcatcct tatgcttgtt tacatctata atcttctatg aaatggtggt accaaggggc 120 

gcccaacagc ttttatcccc attcttagag catattcttt attataatga ttatccaaca 180 

tatttcttta attttaatac aaaaaataca tcatttaatt tttgttacat atgaacattc 240 

atttttaaat gctcagcctc aagtgcaggc atttttgagt ggcctgatta catattcctc 300 



<210> 138 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 138 

ggaaggggag ggttggtgag tcccagacct taaaaataca aggttaagag ggaccccaaa 6 0 

gcaaaaaatt ccaacccttt tcctcccagt cattgaaaca ccaaaactat tataccggag 120 

ggtgtaatag ttttgctgcc cagttgtggt aggccagtag tggcctccca agatgcccat 180 

gtcctaatcc caggaacctg tcaaaattac cttgtatggc caaaggggct ttgcagatgt 240 

aatgaagtta aggatctttc gccaggaaga ttatcccagc ttgttcagga gggcttgatg 300 

<210> 139 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 139 

gacatcattt tcttattcta gtaagagaaa gtacacagat tcaactttag agaggacttt 60 

tttttttctg gagctaaatc aaggaaggat tatcacgtgg cctcccttga atataatttt 12 0 

gaagctgtga acagtaccat cagtaacatt ttatggacag ctctgatggt ttttatacca 180 

cggcactctt cttacctttg ggggaagcta tctggagtta tgactgatgt gtaaagtggt 240 

ttactgttag aatcctggtg tgctaggatt ctgggagagt cactttcagg aagttacctg 300 

<210> 140 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 140 

gctgcccagg cagttttatg gcctcctggt tgtgtgcctt cacacccgcc tacagcccca 60 

cctcaccatc aagcgctgag ccaatgcggg tgtggctggc cctgagttcc tgagtcagct 120 

ccttgccagg gccagagctg gtaacagcgg ggcagcaggg tgggtagcct ctaccagcca 180 

gggcagtccc tgaggggcca gcaggggggc tgactgccta gtggctcaac ctcctgaacc 240 

cacccactcc cagcgatgct acccagaacc ccaacggcat gaatcctgca cagtgccggg 3 00 

<210> 141 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 141 

cccaaactta tcgggggtgc cagaggcaga gtagacaagc cttagtggcc gccatttgtt 60 

gaatatctac tgtgcgccaa gcagtgcgtc acaactttat gaagtaggta ttattatcat 120 

ccccatttta caggtgaaga aactgagtct ctgagagacc aacttttcca aggtcacaca 180 
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gaggtgggat ccagcccact tccgtctgac cccaagcccc tgctgttaac ccctgcccca 240 
ttgtggggag gttccggccc actctggagt tctctggtct gcgtcagtcc tcaggagaag 300 

<210> 142 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 142 

gaaaggtggc gcgcttctca cggctgagtt gctgcgcctg cagacggaag ctccccacag 60 

gcagagctgc ttggatgtgt gagtcatgaa gccagagaag ccccgctcca tgagcagtga 120 

ctccccaggc cctgtgacct ccctcctgtc ttgcagctcc tcctggcacc agtccccagg 180 

gctctcctgt tggtagttcc tgcttttctt cttggaaatt cctcgtggac ctcgagatct 240 

ttaccctaaa atagttctgt tgaatttcac cctggcaatg taaattgata gcttatcttc 3 00 

<210> 143 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 143 
cttggccttg cttctctgag aaaactttgg 
cctggaggag ggggctttcc tggttggtgg 
acaccatggg ctatggcaac accttcctca 
tggacatgaa accctttgtg ctctgaagca 
tttctttggc acttgagcgt gtttgcaggt 

<210> 144 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 144 

cctgactgag tgcctgacgg tggaccccct cagtgccagc gtctgaaggc agctgtaccc 60 
taagcacctg tcacagtcca gccttctgct ggagcacttg ctcagctcct gggagcagat 120 
tcccaagaag gtacagaagt ctttgcaaga aaccattcag tccctcaagc ttaccaacca 180 
ggagctgctg aggaagggta gcagtaacaa ccaggatgtc gtcacctgtg acatggcctg 240 
caagggcctg ttgcagcagg ttcagggtcc tcggctgccc tggacgcggc tcctcctgtt 300 

<210> 145 
<211> 300 
<212> DNA 

<213> Homo sapiens 

<400> 145 
gccagagcct agaggagaga tcaaagaccc 
ctgggttccc atggcatgtg gtggccttag 
tttggtgctc tgcccaggag ctggtgggat 
gcttcctcca gcagcagcat gtgctggggg 
annnnnnnnn nnncncaacc cccgctggac 



tcacacctcc agagccaggg 
cacagcagga gtccaggctt 
ccatccttcc atgaggacct 
ttcaacagaa gctttctggt 
tcattacaca catgatgaaa 



tgggtgcctc 60 

tgtaccgtgg 120 

cgggagagag 180 

tctgtgccta 240 

gctctggccc 300 



tggccgaagt gaagcccatt 
aggaggtgtt cagcctgcca 
ccgagggggc ctacaaggcg 
ccgggggtgg tcctggnccg 
ccccngaanc tggcaagacc 



ctgcaagcaa 60 

ccgtcggtgc 120 

gccgtggaca 180 

actcaagggg 240 

ngctgcccct 300 
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<210> 146 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 146 

tgactttgta cctggtccaa gctgttgggg aattgctgct gttgacccag gcaggagtct 60 

gactagagaa caaactaagg ttgctgcaac aaacaaggac ctcttccaag aagggctccc 120 

aggcctggcg cagtgactca tgcctgtgat cccagcactt gggaggccga ggcgggtgga 180 

tcatttgagg ccaggagttc gagaccagct tggccaacat gatgagaccc cgtctctatt 240 

aaaaatacaa aaattagcca ggcgtggtgg cgcctgtagt cccagctact caggaggttg 300 

<210> 147 
<211> 295 
<212> DNA 
<213> Homo sapiens 



<400> 147 

ggnaangcna nngnaggaga nagagaagna ncagtnnagn cccangaaac ccnntgaaac 60 

ccttagaagn cagaggagng aaaggangaa aaananggnn ggangagaac nnannnnggn 120 

caaannaagg anganngnta ggngngaaaa anaanaacaa anggggaaaa ngggaaaaaa 18 0 

ggcganaaag gnaanannag nanaaggngg aananannnn annagaaagg ncaanaaaag 240 

aagnacaaag aaaaangana anaagnaann annanannga cagagacaag aagga 295 

<210> 148 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 148 

cgctgtgctt gagaccaacc tgacgggtac cttctacatg tgcaaagcag tttacagctc 60 

ctggatgaaa gagcatggag gatctatcgt caatatcatt gtccctacta aagctggatt 120 

tccattagct gtgcattctg gagctgcaag agcaggtgtt tacaacctca ccaaatcttt 180 

agctttggaa tgggcctgca gtggaatacg gatcaattgt gttgcccctg gagttattta 240 

ttcccagact gctgtggaga actatggttc ctggggacaa agcttctttg aagggtcttt 300 

<210> 149 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 149 

agtgtcagtt ttcctaatct cagtccaggt aggaattaag aaatatctca agtgttgatg 60 

ctatccaagc atgttggggt ggaagggaat tggtgcccag aaaatgggac tggagtgagg 120 

aatatctttt cttttgagag tacccccagt ttatttctac tgtgctttat tgctactgtt 180 

ctttattgtg aatgttgtaa cattttaaaa atgttttgcc atagcttttt aggacttggt 240 

gttaaaggag ccagtggtct ctctgggtgg gtactataat gagttattgt gacccacagc 300 



<210> 150 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 150 



tgtagacttt atgtcagttc tgtgtagact ttatgtcagt 

ctattctgac aactttttaa ttcctttgat cttataagtt 

tgcatggatc aagtaagcat agttttatcc agggagaaaa 

tgctctggtc aaaaccaagc acaccatagc cttaactgaa 

tctgcttata tttggtgttt gttttttgac tgttgggctt 



ttttgtcatt 
aaagctgtaa 
ataaaaggaa 
tatttaggaa 
tgggaagatg 



atttgaaaat 
caactgaaat 
gccatagaat 
atctgcctaa 
ttatttatga 



60 
120 



180 



240 
300 



<210> 151 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 151 

gcgggcccgg ccagcggaag cccctgcgcc cgcgccatgt caaagaaaaa aggactgagt 60 
gcagaagaaa agagaactcg catgatggaa atattttctg aaacaaaaga tgtatttcaa 120 
ttaaaagact tggagaagat tgctcccaaa gagaaaggca ttactgctat gtcagtaaaa 180 
gaagtccttc aaagcttagt tgatgatggt atggttgact gtgagaggat cggaacttct 240 
aattattatt gggcttttcc aagtaaagct cttcatgcaa ggaaacataa gttggaggtt 300 

<210> 152 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 152 

gatattcaca cagtatgtat tatattaacc atatcacact taagttatta aattcagact 60 
atttgtaact tattgttata gggcctgccg tatggcttag gatatttgag taatcatata 12 0 

tttaaagtaa aaactttggg ctgggcacag tggctcacac ctgtaatccc agcacttggg 180 
gaagctgagg tgggcagatc agttgaggtc aggagttcta gaccagcctg gtcaacatgg 240 
cgaaacccca tctctactaa aaatacaaaa attagctggg cgtggtggca cacacctgta 300 



<210> 153 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 153 

cagagaccag ccttctccag aggctgtcac tgcaggagcc gtgggcctgg gaagacttgg 60 
aagcggcctc tctcaactgg tttctgtctc cgtggagctg gaactgcctg cacttgcctt 120 
cagagggagg cacagtccac ccagatccac ctttccagca agacccccag tggctgccca 180 
gcctgggagc acctctttgc ttttcacacc aaaccaaaac tggcgagagc ccctcctagc 240 
caccagtgat ccccaagcat ccagtacaga accaggcatc gagctagctc cctgcacggc 300 

<210> 154 
<211> 300 
<212> DNA 

<213> Homo sapiens 
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<400> 154 

cttgacctct gtactttaaa gaaatcacta accaaatttt caaagtttcc ttttaaatgc 60 

gtttagctag aaatctatgt atttatccct ttcctatttt gcattcttct cccactattt 120 

ttaaaaactc atttacagta gaaaccattc ttctttctcc caacagtatc ctttgccaag 180 

accatgagaa cagtatggga gcatgttgtt ggtcagggtt tcagaatacg cgtgatgtca 240 

ctgagaatgt ttgctcacag tcaataattg tctttgtgga tgtgataatt ttggagatac 300 

<210> 155 

<211> 81 

<212> DNA 

<213> Homo sapiens 

<400> 155 

gatcattgtt aattagtgac atagtaacat ctgtagcagc tggttagtaa acctcatgtg 60 
ggggaggtgt gggaggtttt a 81 

<210> 156 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 156 

ggcagcacaa gtgtgcaaac agctatggaa agtgaactcg gagagtctag tgccacaatc 60 

aataaaagac tctgcaaaag tacaatagaa ctttcagaaa attctttact tccagcttct 120 

tctatgttga ctggcacaca aagcttgctg caacctcatt tagagagggt tgccatcgat 180 

gctctacagt tatgttgttt gttacttccc ccaccaaatc gtagaaagct tcaactttta 240 

atgcgtatga tttcccgaat gagtcaaaat gttgatatgc ccaaacttca tgatgcaatg 3 00 

<210> 157 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 157 

ctggtgagga gtctttgcga gagcgaggag cagcggttac tggaacaggt gcatggcgaa 60 

gaggagcggg cccaccagag catcctgaca cagcgggtgc actgggccga ggcgctgcag 120 

aaacttgaca ccatccgcac tggcctggtg ggcatgctta ctcacctgga tgacctccag 180 

ctgattcaga aggagcaaga gattttcgag aggaccgaag aagcagaggg cattttggat 24 0 

ccccaggagt cggaaatgtt aaactttaat gagaagtgca ctcggagccc actactgacc 300 

<210> 158 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 158 

cgacagctct ccaatactca ggttaatgct gaaaaatcat ccaagacagt tattgcaaga 60 

gtttaatttt tgaaaactgg ctactgctct gtgtttacag acgtgtgcag ttgtaggcat 120 

gtagctacag gacattttta agggcccagg atcgtttttt cccagggcaa gcagaagaga 180 

aaatgttgta tatgtctttt acccggcaca ttccccttgc ctaaatacaa gggctggagt 240 

ctgcacggga cctattagag tattttccac aatgatgatg atttcagcag ggatgacgtc 300 
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<210> 159 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 159 

agtacccaga gttgcgagga gttttttaac tgatttagcc aggtggcaat catgagtgaa 60 
tggatgaaga aaggcccctt agaatggcaa gattacattt acaaagaggt ccgagtgaca 120 
gccagtgaga agaatgagta taaaggatgg gttttaacta cagacccagt ctctgccaat 180 
attgtccttg tgaacttcct tgaagatggc agcatgtctg tgaccggaat tatgggacat 24 0 

gctgtgcaga ctgttgaaac tatgaatgaa ggggaccata gagtgaggga gaagctgatg 300 

<210> 160 
<211> 294 
<212> DNA 

<213> Homo sapiens 
<400> 160 

ctttgagcta ggataaaaat tgggtaaagg acatttgctt acctgcaaat gaatcactgt 60 
ggaaatgtga tcttcccata tcatcaagaa acttgttttc tggatgaata ctgggagaat 12 0 

aaaatgagaa ctctggagtg agctaaattg atcccaatta agtttttctg cttagcagac 180 
agaaggtata attttttgac accctttccc acctggtgcc tatgctaggc ttgtnctgat 240 
aacatccctc actnactnga tnntcacatn gnncttncnc tgangtccca tttt 294 

<210> 161 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 161 

cttcctcaaa gcatggttgc tgagtaccca gagttgcgag gagtttttta actgatttag 60 

ccaggtggca atcatgagtg aatggatgaa gaaaggcccc ttagaatggc aagattacat 12 0 

ttacaaagag gtccgagtga cagccagtga gaagaatgag tataaaggat gggttttaac 180 

tacagaccca gtctctgcca atattgtcct tgtgaacttc cttgaagatg gcagcatgtc 24 0 

tgtgaccgga attatgggac atgctgtgca gactgttgaa actatgaatg aaggggacca 300 

<210> 162 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 162 

gccccgtgtg gggagacgga cagcaccctc ctcatctggc aggtgccctt gatgctatag 60 
cgcctcccct ctcccctcag agggcacagc tgcaggcctg accaaggcca cgcccggctc 120 
tcgtgctcta ggacctgcac gggacttgtg gatgggcctg gactctccag aaactacttg 180 
ggccagagca aaggaaaacc tcttgtttta aaaaaatttt tttcagagtg ttttggggag 240 
gagttttagg gcttggggag agggaggaca catctggagg aaatggcctt ctttttaaaa 300 

<210> 163 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 163 

gaccggctgg gcctacaaaa agatcgagct ggaggatctc aggtttcctc tggtctgtgg 60 

ggagggcaaa aaggctcggg tgatggccac cattggggtg acccgaggct tgggagacca 12 0 

cagccttaag gtctgcagtt ccaccctgcc catcaagccc tttctctcct gcttccctga 180 

ggtacgagtg tatgacctga cacaatatga gcactgccca gatgatgtgc tagtcctggg 240 

aacagatggc ctgtgggatg tcactactga ctgtgaggta tctgccactg tggacagggt 300 



<210> 164 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 164 

aaaatttata ngtaatgaca aatgacttat cagtgttcat catctgaaag ctaagtggtt 60 

cgttcaatca ctttttcaaa gttgatagta gattgcatgg tttcatgttt cctcatattg 120 

gtttattaat tctatttaat caaggaaaat aacttcagat tccataaagt ttcagtttat 180 

ttttagttta ctactaggtg agatagcaca ttacatactt ttactatcaa atattatttt 240 

agcagcttcc catagtacca aatgatttga ttccctactc tcatttttta aagcatataa 300 



<210> 165 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 165 

ctggactctg agtcgtcttg gtcccaggag ccagtagtga aggcaacagt ctgcccacct 60 

gtggacacca gatcctggga gctcctggtt agcaagtgag atctctggga tgtcagtgag 12 0 

gctggttgaa gaccagaggt aaactgcaga ggtcaccacc cccaccatgt cccaggtgat 180 

gtccagccca ctgctggcag gaggccatgc tgtcagcttg gcgccttgtg atgagcccag 240 

gaggaccctg cacccagcac ccagccccag cctgccaccc cagtgttctt actacaccac 300 



<210> 166 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 166 

cttctgttga ttggtttgtt taaagtacct aagtactacc ctttgactcc ctaccaaaag 60 

ttcttttgtt ttttaaacaa cttttatttg tgacttactt tcttgagaag tgttcttaat 120 

gaattgcata aaatagtggt agcagcttat ttcttaagta ctttattatt tgtgctttac 180 

catttcaggt tcttatcttt aacccttatt tactcagttt tccatctgaa tgatcctatc 240 

tctaaattaa ggatttaata aatgctgcaa attgtccact ttgcaaattg tccaaaagct 300 



<210> 167 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 167 
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gcgagatgaa gctacactgt gaggtggagg tgatcagccg gcacttgccc gccttggggc 60 

ttaggaaccg gggcaagggc gtccgagccg tgttgagcct ctgtcagcag acttccagga 120 

gtcagccgcc ggtccgagcc ttcctgctca tctccaccct gaaggacaag cgcgggaccc 180 

gctatgagct aagggagaac attgagcaat tcttcaccaa atttgtagat gaggggaaag 240 

ccactgttcg gttaaaggag cctcctgtgg atatctgtct aagtaaggat tccatatggc 300 

<210> 168 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 168 

gtctgggcag cctacgcttt ccggataaaa atggcagaat gaaagaatta tgagtggaac 60 

tagagaatag gaaagacatg aaccaacgcc caaaatgaga aagaaggaca tataaagaaa 120 

aagacaaata caagtgaaaa aaatatacta atggattaac gtccctgtcg agtgacattt 180 

tctgactatg gaaatgatat tagacaaaaa gcaacttcaa gtgggtttct tatttgagtt 240 

caaaatgggt cataacgcag catagataac ttgaaacatg aacagcgcat ttggcccagg 3 00 

<210> 169 
<211> 296 
<212> DNA 

<213> Homo sapiens 



<400> 169 

gagatctctg ggatgtcagt gaggctggtt gaagaccaga ggtaaactgc ggaggtcacc 60 

accctcacca tgtcccaggt gatgtccagc ccactgctgg caggaggcca tgctgtcagc 120 

ttggcgcctt gtgatgagcc caggaggacc ctgcacccag cacccagccc cagcctgcca 180 

ccccagtgtt cttactacac cacggaaggc tggggagccc aagccctgat ggccccgtgc 240 

cctncattgg gncccctggc tanttcancn agncccncag gtngagncca aagcca 296 

<210> 170 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 170 

gggtgttgga gcagattgta gttgatccac agcaaagagc atcaccaaag ccattccagg 60 

aggaactaga tccaccactt cctctgctgg gcatgctcca aaaatggttg tggcttccag 120 

agaggactcc aaaagaaagc acaaaaacta gacagtggga gggcataccc aaaagccctg 180 

agtttctgaa aaaatattga aagtttctat ggtgaaatag gaagttaatg tgcttaggaa 240 

gaaaaaagtg gtaatgattc aaggaaacat aatcacacac ggttttagtt ttaatggaca 300 

<210> 171 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 171 

atggaggcac cagcaggtag tggcccctgt aagcagggcc agagtcggga caaagagcag 60 

gagtgaagca gccaagagac agaggaccag gctggagcca gtgggcacgc aggagcctgc 120 

ctgggaaaag ccggggggca aggctggcat gggaatgaac acctgctggt gacacctctc 180 
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tgagcttcag ttcccttaac tagaaaaata gaacaggccc ggtgcggtgg ctcatacctg 240 
taatcccagc actttgggag gctgaggcgg gtggatcatg aggtcaggag atcaagacca 300 

<210> 172 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 172 

ggcggaggag cagaagctca agctggagcg gctcatgaag aacccggaca aagcagttcc 60 
aattccagag aaaatgagtg aatgggcacc tcgacctccc ccagaatttg tccgagatgt 120 
catgggttca agtgctgggg ccggcagtgg agagttccac gtgtacagac atctgcgccg 180 
gagagaatat cagcgacagg actacatgga tgccatggct gagaagcaaa aattggatgc 240 
agagtttcag aaaagactgg aaaagaataa aattgctgca gaggagcaga ccgcaaagcg 300 

<210> 173 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 173 

gtctttccca ttcacttctc tagaaagctg ccaagacaga ggcagaaaga aatggatgat 60 

agttctgtca agcacacttc tgttctctta gaacttagaa gtgtttctaa gagaacagaa 120 

gtaataagag aaacagttac gtgtggaatt caacatcttt ggttggaacg cattggcttt 180 

ttttttcttg ttttgataga aatggaatta agcaaaagta gtttttgtct tttctgttgt 24 0 

cttcaaattt caggccatct atttttaatt taatcccgtt caagtacttg attgttatac 300 

<210> 174 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 174 

attatttcca aagcagccta cagtagaaaa tagtcattat ggcagcagct tctgatgttt 60 
ttgtttggta ggttttctga tttcaatata tagaatcata ttcatagagt atcttctttt 120 
aacgaattgc acaaagtacc catttaaaat ttacatgcac agttcattgc cacctttctt 180 
aggcctatgc atagttaata aggttataat ctactcaaca tggaaaatgg agcctatttg 240 
caaacacaca agtaattaaa gtaccaattc tctcttagtt tcttttttta tagttggttt 300 



<210> 175 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 175 

tgganactct ttantatgga aggtgaattt cctgtcaaca tagtccagga caaagcagtt 60 
ccaattccag agaaaatgag tgaatgggca cctcgacctc ccccagaatt tgtccgagat 120 
gtcatgggtt caagtgctgg ggccggcagt ggagagttcc acgtgtacag acatctgcgc 180 
cggagagaat atcagcgaca ggactacatg gatgccatgg ctgagaagca aaaattggat 240 
gcagagtttc agaaaagact ggaaaagaat aaaattgctg cagaggagca gaccgcaaag 300 
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<210> 176 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 176 

tataaacttt attttattct cttctggggt agagttacat gacaagaaat tgaattaatt 60 

caataaaatt ttagttcggg ttgcttaggt ttttactgct cccattcttg cttttactaa 120 

tttatccaag attagatgtg attactattt aataataatt tagtcctcac acttacaaac 180 

cacttacaat accagcatgc ttctatcact gtaattctat tcaattctca ggcccatgag 240 

gcatgccagc cagacgacca gacagcattt atagagaggg cactcaatac cagccacaaa 300 

<210> 177 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 177 

gactggagaa gtcagaagta gaaaagcaga ttgctaggag agacaggatg acagattttg 60 

gtcagaaaat gggatattgg agtttaaagt atcaaataca gaatagttcc agatgttcag 120 

agatccagca tgggattagg tactgaaatg gattagaact aaaagtcact agaatttaga 180 

aattgagaac catgagagtg gatgcaatga cttgttgctt gattgaaaaa taaattaata 240 

ataataaagg accatgagac tagcctgtta tagggggtat ctccatgann nttgtttttc 3 00 



<210> 178 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 178 

tcctggtgtc aaacactata aacctttgac cagctgagct gtgactgctg tcacatatct 60 

gagtcctgtg tgcacagtaa tatcctgggt caggtaaaat ccaggtcttc aagttttaag 120 

gattttttga agaattcggg cttctttaag acgatccatg cccaaatcca caagcttgtt 180 

gacagtggat tacagtttgt gtggcaaagt ccaagttgtt acactgtgct ttaaaaaaaa 240 

tcttatctgc atgtattgtt aacttagaga ccatgagatc tatttatcag gaccaggaag 300 

<210> 179 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 179 

ctcatgcctg taatcccagc actttgggaa gcagaggtgg caggatcatt 
agttcaagac cagcctgggc aacacagtga gtgagaccct gtctctattt 
aattaagaaa ttttattaaa aaagaagaat caggaaacca agtccaaccc 
caaatgaacc agcccctaac acagatgagg ggatttggga ctgataagct 
tccatggccc gtcatttatc aaggctgcag ctttgtaaat gtggctattt 



ccagcccagg 60 

aagaaaaaat 120 

aactaaacct 180 

ctgtgctgtg 24 0 

ttatgttgtg 300 



<210> 180 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 180 

gtgatctgcc tgccttggtc tcccaaagtg ctgggaatac aggcatgagc caccgcactc 60 

ggccaggagc tagttttatc agcatcctgc tccactgcct tcctctagtg cagcctggaa 120 

gacatggcag cgggtagctc ctggggctga gccagaagca tcactgcagt gaaagtctct 180 

gcttacctgt ctggctcagc ttgggcaagg gctgggccat atgtgctcag ggacgtgctt 240 

ctcttgtaag gcaggaggat agaagaggac caagaaggga gggagctgcc ctgtggtgca 300 



<210> 181 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 181 

cccatgccgg gatcttccca cacccgtcct cacagatcca gccccagccc cttgcttccc 60 
aggccatctc tcagcagcac ctgcaggatg cgggcacccg ggagtggagc cctcagaacg 12 0 
catccatgtc ggagtctctc tccatcccag cttccctgaa cgacgcggct ttggctcaga 180 
tgaacagtga ggtgcagctc ctgactgaaa aggccctgat ggagcttggg ggtgggaagc 24 0 

cgcttccgca cccccgggcg tggttcgtct ccttggatgg caggtccaac gctcacgtta 300 

<210> 182 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 182 

tttgcagtgt tgtcagaaac aaataataaa gccccaaaag attaactagt tgaaaaaact 60 

ggcaaaatct gtatacgtgg aaatttacca ggacagagac tgaagaataa agaaaatgag 120 

tttcattgcc agatcatgaa atccaaagaa actttaaaga agatgagttg tgtaaatgga 180 

actgaaggga gggaagagct gccttcgcct gggacaaaga aaacatgtgt atacacatgg 24 0 

gtcaagcagt gctggtctgt ggctgcctgt ccagaggaat ggaaatatcc cttgtcttta 300 

<210> 183 
<211> 300 
<212=> DNA 
<213> Homo sapiens 



<400> 183 

cggacccatc ggagcgtaac ctggatctcc gcaggcctgg cggaggccgg ccacctggag 60 

gggcattgct tggttcgcgt ggtagcagag gagcttgaga atgttcgcat cttaccacat 120 

acagttcttt acatggctga ttcagaaact ttcattagtc tggaagagtg tcgtggccat 180 

aagagagcaa ggaaaagaac tagtatggaa acagcacttg cccttgagaa gctattcccc 240 

aaacaatgcc aagtccttgg gattgtgacc ccaggaattg tagtgactcc aatgggatca 300 

<210> 184 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 184 
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ctgttttgca gatgaggaaa ctgaggtaca gaattcttag ggaacttacc caaaatggct 60 

tttctgcact ctgccctttg gtattgtccc atgtgaattg tttaaaactt atgtgtatag 120 

tggcatgagt aggtgatttc agaaacagaa ctcacttttg ttgtttggtc ttaaaattag 180 

gaacttttct tcatctgggc ttcatttccc tgcaccttcc cagctttcta gtcatgcaag 240 

ccacatgtct ccacgtgagg ggttcattgg aaagcagcca cagagccacc ccctggctgg 300 

<210> 185 
<211> 260 
<212> DNA 

<213> Homo sapiens 
<400> 185 

attatagaga ttaatctcct ttgctcgaag tctatttaaa tattagtcac atctaaaaca 60 
tacttttaca gcaacatcta gactggtgtt tgaccaaaca actgggcatc atagctgaca 120 
cataaaatta accatcacaa ccatgttcta ggcactgttc ctcactgcct gagaagacac 180 
cgttatgttt attagggttt ttgagtttta tccacagctt ttggttatct gcaaccatgt 240 
ctcccacctt taacatagtt 260 

<210> 186 
<211> 300 
<212> DNA 

<213> Homo sapiens 

<400> 186 
gataaactct tcagtgacga atattagaaa 
aaagtaccaa taatgagtag gaaatcactt 
catgacataa aagccaaagg tcacaaggga 
gagatgtatg cacatataca tacaacagtc 
gtatgaaaaa gaggcaaaac aacaagaaga 

<210> 187 
<211> 300 
<212> DNA 

<213> Homo sapiens 

<400> 187 
aaaaagtaaa gcttttcatg agcacaaatc 
taaaatgaat attttttgtt ttgttttgtt 
gcccaggctg gagtgcaatg gcatgatctt 
aagtgattct tctgcctcag cctcctgagt 
ccagctaatt tctgtatttt tagtagacac 

<210> 188 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 188 

gagcattcct cctttgttaa cgaagcaaca tttacacaag atggacatta cattattagt 60 
gcatgctctg atggcactgt aaagatctgg aatatgaaga ccacagaatg ttcaaatacc 12 0 

tttaaatccc tgggcagcac cgcagggaca gatattaccg tcaacagtgt gattctactt 180 



aagttagtta tacatttgag gaaaactata 6 0 

ctgcagtatt tttggagcat tttccttaag 120 

aaaaactgat agatttgtct gtgatattga 180 

atagtaagac accgttagac aaaaggtgat 240 

aaagattgaa aaaatgagag ctgaagacgg 300 



ccttgcattg tttgatgtta 
ttattttttt gagacaagtc 
ggctcactgc aacccctgcc 
agctgggatt acaggcgctc 
agggttttac catgttggcc 



ctgatattcg 60 

ttgctttgtt 120 

ttgcgagttc 180 

accaccacac 240 

angctggtct 300 
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cctaaaaacc ctgagcactt tgtggtgtgc aacagatcaa acacggtggt catcatgaac 24 0 

atgcaggggc agattgtcag aagcttcagt tctggtaaaa gagaaggtgg ggactttgtt 300 

<210> 189 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 189 

ctaatatcca gaatctacaa agaactcaac aagaaaaaaa ccaaccccac aagcgggcaa 6 0 

aggacatgaa cagacatttc ccaaaagaag acatacaagc aacctaaaat aatctaaaat 12 0 

aatttttaaa aagaaaaaat gcttgacaga gttttgatag tacttagtaa aaagttatat 18 0 

ctagtggctt tttgtttgtt tgtttttgtt ttgtttttaa gaaatagtct ctgtttccca 24 0 

agctggagta cagtggcgca atcttggctc actgcaacct cgaactcctg ggctcaagcg 300 

<210> 190 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 190 

aaccactatg gaggcatgat tggtggccac tacactgcct gtgcacgcct gcccaatgat 60 

cgtagcagtc agcgcagtga cgtgggctgg cgcttgtttg atgacagcac agtgacaacg 120 

gtagacgaga gccaggttgt gacgcgttat gcctatgtac tcttctaccg ccggcggaac 180 

tctcctgtgg agaggccccc cagggcaggt cactctgagc accacccaga cctaggccct 240 

gcagctgagg ctgctgccag ccagggacta ggccctggcc aggcccccga ggtggcccca 300 

<210> 191 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 191 

gcggcgctga cccggccggc cccacacccg ctcttcctct tctttgccgc ggactccctt 60 

tcctgcctcc aagacctggt gtctcccact gtgagcccag ctgtcccaca ggcagtcccc 120 

atggacctag actcaccttc cccttgcctc tatgaacctc tgctgggccc agcccctgtc 180 

ccagctcccg acctgcactt cctgctggac tcaggcctcc agctccctgc ccagcgagcg 240 

gcctcagcca ccgcctcccc tttcttccgg gccctgctgt caggcagctt tgcagaagcc 300 

<210> 192 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 192 

gacagaccgt tgagaggacg tggaggcccg agagggggta tgcgcggcag aggcagaggt 60 

ggccctggga acagagtttt tgacgctttt gaccagagag gaaagcgaga atttgaaaga 120 

tatggtggga atgacaaaat agcagtcaga actgaagaca acatgggtgg atgtggagtt 180 

cgaacctggg gatcgggtaa agataccagt gatgtggagc caactgcacc gatggaggaa 240 

cccacagtgg tggaggagtc ccagggcacc ccggaagagg agtctccagc caaagttcct 300 
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<210> 193 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 193 

ctcaagaaag gagaagtttt tttgtatgaa attggaggaa atattgggga acgctgcctt 60 

gatgatgaca cttacatgaa ggatttatat cagcttaacc caaatgctga gtgggttata 120 

aagtcaaagc cattgtagaa gacttaacaa gctgcagata accatgtgga cttctgtcat 18 0 

aattcttgct gagtcaagag tgtaaataaa agaaatggca ggactcatat tattcagttg 240 

tacccaagta tttaaaaatg actctcttaa gccttaaaaa gtcatagatt tgtgctgctg 300 



<210> 194 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 194 

cagaagctta gtcatatttc aaaatgatca aatatcaaga aaaattctga gctgcataac 60 

ttgtataaag taattttcag tgattttttt catggttatg ataaaagaac tggattagca 120 

gaaactttta ccctgaatca agatttaatt tttctttgag ctcatcttaa ggatatcgga 180 

acatagggag caaacgatgg tgtggctgcc tcagtgcttg atttttaacg gttttgaaga 24 0 

gaatagttac atttcttctc ctagtaagaa ctaataaata cattaacaga aatgaattcc 300 

<210> 195 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 195 

ctctactaaa aatacaaaaa ttagctgggc gtggtggcac acacctgtaa tcccagttac 60 

ttgggaggct gaggcacaag aatcgcttga acccgggagg cggaggttgc agttagccaa 120 

gatcgccctg ctgcactcca gcctgggcaa cagagggaga ctctgtctcc aaaaacaaaa 180 

acaaaaactg ttagtgaagg ttccctggga cttttgatat tttaaaaatt gatcttatga 240 

ctaagtagat aaattcattg ccataatgag gctagctccc agataaacag cgtattttct 300 



<210> 196 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 196 

tggatactga caatggtggc aggcatttca agccttttaa attagtactt tttgtcgtct 60 

tgcttattaa aattttgtta attttagcaa agaccaattg ttgtgataaa ctggtgtttt 120 

ttggatgctt caagcacacg ttaaccaatt ttttaattcc ccttttggtt cctcccattg 180 

ttctaaaata ggactttcat attattaaaa cctcaaaaga tgatccaccc aggatgaaca 24 0 

aagatcacca aggggaaaga aaacattttt tatctttaca gaaaacatgt taagattata 300 



<210> 197 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 197 

atccagatgg gatacctcta aacacgaaaa gaaagaagat tccattagtg aatttttaag 60 

tttggctaga tcaaaagccg agccacctaa acaacagtcc agccccttag taaacaaaga 120 

ggaagagcat gcaccagaat catccgcaaa tcagacagtc aacaaagatg tggacgcaca 180 

ggctgaagga gaagggagcc gcccatccat ggacttattc agggccatct ttgccagttc 240 

ctcagatgaa aagtcctcat cctccgagga tgagcaaggt gacagtgaag atgatcaggc 300 

<210> 198 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 198 

gcaacatttg tctacaactc tactgtaaaa ttggaaatgc ttttccacag aaaaacctct 60 

caaaatgctg aatgcaaaag ttgggatcac agaaacattg tgcctatttt tggtctgctg 120 

gaaactgtat ttttacaagg taatccctgt tttcaatata gttcctgtct tgccactggc 180 

ggttttcttg tagcattttt ctagttctga gattgctact acccaaagta ttcatttctt 24 0 

tcttactggg gtgtcctctg tcttcacagc ctgcttctgg attgtaggtt ttttcctttc 300 

<210> 199 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 199 

gcaacatttg tctacaactc tactgtaaaa ttggaaatgc ttttccacag aaaaacctct 60 

caaaatgctg aatgcaaaag ttgggatcac agaaacattg tgcctatttt tggtctgctg 12 0 

gaaactgtat ttttacaagg taatccctgt tttcaatata gttcctgtct tgccactggc 180 

ggttttcttg tagcattttt ctagttctga gattgctact acccaaagta ttcatttctt 240 

tcttactggg gtgtcctctg tcttcacagc ctgcttctgg attgtaggtt ttttcctttc 300 

<210> 200 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 200 

agtagaaaaa tacaaagact gtgatccgca agttgtggaa gaaatacgcc aagcaaataa 60 

agtagccaaa gaagctgcta acagatggac tgataacata ttcgcaataa aatcttgggc 120 

caaaagaaaa tttgggtttg aagaaaataa aattgataga acttttggaa ttccagaaga 180 

ctttgactac atagactaaa atattccatg gtggtgaagg atgtacaagc ttgtgaatat 240 

gtaaatttta aactattatc taactaagtg tactgaattg tcgtttgccc tgtaactgtg 3 00 

<210> 201 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 201 
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ttctactttg ggtccgcgcg aagcccactc acgtgtgatc tgtgttgccc ctctcggtgg 60 

tcccaggcga tccagccatg ccccctgccc ctctgcccag atgcttcagg ggcccggctt 120 

ttcaggcttg ccctcaccag cggccgtcag ccgacactca gggatgtagc taacaccact 180 

ccgccagtgc tttcagtagg aagagctgag gctgcctggg aggcccgggg cgaccggaaa 240 

agggctctct caagttctga aaagagaatc tgccaccaga tcgaatttcg acccctgagc 300 

<210> 202 
<211> 281 
<212> DNA 
<213> Homo sapiens 

<400> 202 

ggccatggga cagttgcaac agcagttaaa tggactgtca gtcagtgaag gtcatgattc 60 

tgaagatatt ttgagcaaaa gtaacctgaa cccagatgcc aaggagttta ttccaggaga 120 

gaagtactga gccgagaaag ctttgaggaa gacttgtctg tccccacatc tggggatagt 180 

aatgcccaaa atggtggagc tgaagagggg gatggggcgg gcgaggggtg cacagcggga 240 

a 9999 a 9 t 99 tggtctcacg atactgtgac tctgagtaac t 2 81 

<210> 203 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 203 

gccctcagcc acccccatcc ctgccccttc tgagactcac agcacccctt tccttcctct 60 

cctcccacct cctccctcag cccctcattc tccttgggaa tctgcagagg gctctgggac 120 

tcactgccgg atgtgaaatc caggcgtcag ctgtttccta ggcaagggca ggaaagtggt 180 

ctccagccct tgctccactc atgcctgggg gcctggggct gagtggtatc cctacctggc 240 

ctccccctgg cctctgggcc tccagcgctg ggtttgtcga gtgagagaga gagaggagct 300 

<210> 204 
<211> 269 
<212> DNA 
<213> Homo sapiens 

<400> 204 

gcggactctc aggacgaaaa gagccaaacc tttttgggaa aatcagagga agtaactgga 60 

aagcaagaag atcatggtat aaaggagaaa ggggtcccag tcagcgggca ggaggcgaaa 120 

gagccagaga gttgggatgg gggcaggctg ggggcattgg gaagagcgag gagcagggaa 180 

gaggagaatg agcatcatgg gccttcaatg cccgctctga tagcccctga ggactctcct 24 0 

cactgtgacc tgtttcagga gcctcatat 269 

<210> 205 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 205 

ttctactttg ggtccgcgcg aagcccactc acgtgtgatc tgtgttgccc ctctcggtgg 60 

tcccaggcga tccagccatg ccccctgccc ctctgcccag atgcttcagg ggcccggctt 120 

ttcaggcttg ccctcaccag cggccgtcag ccgacactca gggatgtagc taacaccact 180 
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ccgccagtgc tttcagtagg aagagctgag gctgcctggg aggcccgggg cgaccggaaa 240 
agggctctct caagttctga aaagagaatc tgccaccaga tcgaatttcg acccctgagc 300 

<210> 206 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 206 

gggattacag gcatgaccca ccgcgcccag cctgtaattt cttatacttg gtattttgta 60 
cttggattat gcttctgata cgctataatt atttatgtac atgttatttt tcttcaatag 120 
actgtgaact cttcgaatgt aggactccta gagctagata ctcaattatt ttttattaaa 180 
ttgaatgact tgaaactaca gatcctttat ttaaacttcc caaatttctg ctttatctag 240 
gcaactcttt aaattctttg atctcatgta gattccaaag gctgaaataa ttgagatttt 300 

<210> 207 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 207 

tcctgaagct cggggggctg caggtcctga ggaccctggt gcaggagaag ggcacggagg 60 

tgctcgccgt gcgcgtggtc acactgctct acgacctggt cacggagaag atgttcgccg 120 

aggaggaggc tgagctgacc caggagatgt ccccagagaa gctgcagcag tatcgccagg 180 

tacacctcct gccaggcctg tgggaacagg gctggtgcga gatcacggcc cacctcctgg 24 0 

cgctgcccga gcatgatgcc cgtgagaagg tgctgcagac actgggcgtc ctcctgacca 300 

<210> 208 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 208 

attccaaagg tttcaaagaa cttggtcata aatatgataa tgagaagaca aagtatttat 60 
attaaaacag tttagtagcc ttcagttttg tgaaaatagt tttcagcaca gaaactgact 120 
tctttagaca aagttttaac caatgatggt gtttgcttct aggatataca ctttaaaaga 180 
actcactgtc ccagtggtgg tcattgatgg cctttagtaa attggagctg cttaatcata 240 
ttgatatcta atttetttta accacaatga attgtcctta attaccaaca gtgaagcact 300 

<210> 209 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 209 

gagacagcag cccccaggga atgaagctga tgccagagtc agacccgagg aggaagagga 60 

gccactgatg gagatgcggc tccgggatgc gcctcagcac ttctatgcag cactgctgca 120 

gctgggcctc aagtacctct ttatccttgg tattcagatt ctggcctgtg ccttggcagc 180 

ctccatcctt cgcaggcatc tcatggtctg gaaagtgttt gcccctaagt tcatatttga 240 

ggctgtgggc ttcattgtga gcagcgtggg acttctcctg ggcatagctt tggtgatgag 300 
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<210> 210 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 210 

gtaacgtgac acgtatttta cttcttttag taggcggaca cactttctta aagtggtaat 60 

acgtcatggc cctgctataa ggtagtagtt ctagaagact gtttatctaa taattcagac 120 

taaagctatt tatattgctg tgacaccacg tggaaaactt ttataattcc atcttatttc 180 

tgatgtatat gttttatttt ctctgccttc ataagaacta aaaaccaaag ttatttacgt 240 

gaaaacaaga tttttgtttg agttcattta cttgagatat gtttaaaaaa tccaccttct 300 

<210> 211 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 211 

gtccgtcagc tggtagcttt cattcgtaaa agagataaaa gagtgcaggc gcatcgaaaa 60 

cttgtggaag aacagaatgc agagaaggcg aggaaagccg aagagatgag gcggcagcag 120 

aagctaaagc aggccaaact ggtggagcag tacagagaac agagctggat gactatggcc 18 0 

aatttggaga aagagctcca ggagatggag gcacggtacg agaaggagtt tggagatgga 240 

tcggatgaaa atgaaatgga agaacatgaa ctcaaagatg aggaggatgg taaagacagt 300 



<210> 212 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 212 

gcctgctgct tcatgccgcc ggcgtcctgc tccacgtctc tgtgctgctg ggccctgcac 60 

tgtcggccct gctgcgagcc cacacgcccc tccacatggc tgccctcctc ctgcttccct 120 

ggctcatgtt gctcacaggc agagtgtctc tggcacagtt tgccttggcc ttcgtgacgg 180 

acacgtgcgt ggcgggtgcg ctgctgtgcg gggctgggct gctcttccat gggatgctgc 240 

tgctgcgggg ccagaccaca tgggagtggg ctcggggcca gcactcctat gacctgggtc 300 

<210> 213 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 213 

ggtatggttg gagtgtagga atgaatattc atgaaatgtt tcttattgct tttccttccc 60 

taattcatac aatgaatgta tttggaatac ttacatatta taaaataaac tatacctctt 120 

caagaggtat cctgttctgt aagatcagat gtttttattg caggtcaata taatactgcc 180 

agagacagaa aataccccct tatcagtccc ttagtgcctc tttctgtttg tggcatggtg 240 

agaaaaccca tgctgaaaag attgtacttt gtgatcccaa tcagagggag gagctaatct 300 



<210> 214 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 214 

ggaaagggcc ctaaaagaga tgaacaatac ccgtatcatg tggaatgaat tagaaaccct 60 

tgtcagagcc catatcaaca actcagagaa acatcaaaga gtcttggaat gtctgatggc 120 

atgcaggagc aaacccccag aagaggaaga acgaaagaaa cgaggaagaa agagggaaga 180 

caaagaggac aagtcagaga aagcagtgaa agattatgaa caggaaaagt cttggcaaga 240 

ctcagagaga ttaaaaggaa tcttagaacg tggaaaagaa gaattggctg aagctgagat 300 

<210> 215 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 215 

atacttttta aacctttttt ggcagctcag atggtgtaaa ttttaaaatt ttgtataggt 60 

atttcataac aaaaatatgt atttcttttt tgttatttta tcttgaaaac ggtacatatt 120 

ttagtatttg tgcagaaaaa caagtcctaa agtatttgtt tttatttgta ccatccactt 180 

gtgccttact gtatcctgtg tcatgtccaa tcagttgtaa acaatggcat ctttgaacag 240 

tgtgatgaga ataggaatgt ggtgttttaa agcagtgttg cattttaatc agtaatctac 300 

<210> 216 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 216 

gcagatattt actgaaggaa tctaggttgt attttcagtg gacaatggga ataaagcatt 60 

tctaaagcac cgactggaga ggaaggcaac agagacaagg agagaagccg agagacatgt 120 

ctgcgtgctg ccacgcatct gagcgattgc tctgtgaaga gttgtacact gaacattttc 180 

aggggaggct gtttacccag gcaatgtcct caaacaagcc tgtgccgggg agtcctggaa 240 

tctgtgccag gactgtgttt ttagcccttc acctctcagc tttagcagga catgaaccag 300 

<210> 217 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 217 

cccccatctt cactggttat tccacttatt taaaatgtcc agaataagca aatctccata 60 

tagaggaagt agattagtgg ttgcttcggg atgggaggaa tgggaagatt gaggtctttc 120 

ttttgcagtg ataaaaatgt cctaaaattg actgtagcga tggtcacaca actctgaata 180 

tgcttaagac cattgaatta cacactttac gttggtgaat tgtatggtat gtaaattata 240 

gttcaataac atagttacaa aagataatca aaagcatgaa agcactgttg atgtggtttg 300 



<210> 218 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 218 
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acggcctggt ggagcagctg tacgacctca ccctggagta cctgcacagc caggcacact 60 

gcatcggctt cccggagctg gtgctgcctg tggtcctgca gctgaagtcg ttcctccggg 120 

agtgcaaggt ggccaactac tgccggcagg tgcagcagct gcttgggaag gttcaggaga 180 

actcggcata catctgcagc cgccgccaga gggtttcctt cggcgtctct gagcagcagg 240 

cagtggaagc ctgggagaag ctgacccggg aagaggggac acccttgacc ttgtactaca 300 

<210> 219 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 219 



agagaacctg 


gagatgtagc 


tactccacca 


60 


ccagcaaagg 


aaatggagaa 


ggtagaggag 


120 


ctgacacagg 


attatgaaga 


atggaaaaga 


180 


aaggctacag 


cagagtgatt 


tcagcttcca 


240 


ttgccatctc 


caggaagact 


tgacggcttt 


300 



<210> 220 
<211> 260 
<212> DNA 
<213> Homo sapiens 

<400> 220 

ggtaagtcag gtgattgaat cccggaaagg ttcattgtct tcaagctcac aatactattt 60 

tgggacaaac agttgtctag tgtttggact catgaaccct gattcttgag ggtggtattt 120 

tactgctttt gtgatttggt ttcaacatat atagtctttt ctccggagtt accttaggtc 180 

agtggccagt gtttcagccc ctggaaaggg catgggctgc cactgaggtt ggtcacaggc 240 

ctctcagctc atggtgggag 260 

<210> 221 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 221 

gggttccatc ccttccaccc aggaaatgga ggcacgactt gcagcgttgc agggcagagt 60 

tctaccttct caaacccccc agccggcaca tcacacaccg gacaccagga cccaagccca 120 

gcagacacag gatctgctaa cgcagctggc agctgaggtg gctatcgatg aaagctggaa 18 0 

aggaggaggc ccagtgaccc tccaggacta tcgcctccca gacagtgatg acgacgagga 240 

tgaggagaca gccatccaaa gagtcctgca gcagctcact gaagaagctg ccctgatgag 300 

<210> 222 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 222 

gcggtgaccc acgtgtcctg catgattgcc ctactgctgt ggagacctcg tgctgaccat 6 0 

ctggcagtgt tcttcgtatt ctctggcctg tggggcgtgg cagatgccgt ctggcagaca 120 

caaaacaatg ctctctacgg cgttctgttt gagaagagca aggaagctgc cttcgccaat 180 
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taccgcctgt gggaggccct gggcttcgtc attgccttcg ggtacagcac gtttttgtgc 240 
gtgcacgtca agctctacat tctgctgggg gtcctgagcc tgaccatggt ggccgtatgg 300 

<210> 223 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 223 

gccccctctg gatcctgagc tccctgctct agacagtgat ggtgattcag atgatggcga 60 

agatggtcga ggtgatgaga aacggaaaaa taaaggcact tcggacagct cctctggcaa 120 

tgtatctgaa gggggaaagc cctcctgaca gccaggagga ctctttccag ggaagacaga 180 

aatcaaaaga caaagctgcc actccaagaa aagatggtcc caaacgttct gtactgtcca 240 

agtcagttcc tgggtacaag ccaaaggtca ttccaaatgc tatatgtgga atttgtctga 300 

<210> 224 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 224 

ctgcggccgc aggagctgtg gcggttttcc taatcctgcg aatatgggta gtgcttcgtt 60 

ccatggacgt tacgccccgg gagtctctca gtatcttggt agtggctgag tccggtgggc 120 

ataccactga gatcctgagg ctgcttggga gcttgaccaa tgcctactca cctagacatt 180 

atgtcattgc tgacactgat gaaatgagtg ccaataaaat aaattctttt gaactatgat 240 

cgagctgata gagaccctag taacatgtat accaaatact acattcaccg aattccaaga 300 

<210> 225 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 225 

gccccgctcc atgagcagtg actccccagc tcctcctggc accagtcccc agggctctcc 60 

tgttggtagt tcctgctttt cttcttggaa attcctcgtg gacctcgaga tctttaccct 120 

aaaatagttc tgttgaattt caccctggca atgtaaattg atagcttatc ttcacagatg 180 

ccagacaatg gacaactcac catcagtcct ctgctcacct gagacaaatg catgtctgat 240 

tgcttcctct gccctattgt ttatgtgaaa atgcagattc actgagccag actaaggcat 300 



<210> 226 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 226 

tatataacaa cttttgcttt caaagttggg tgggactaga acacacaatg gaaggatgga 60 

gtcaggagac ctggattctt gtgcccgctc tggcttttac agtctgccta actctatgca 120 

gtcacttcct gccagcctgt ttccttacct acaagaggga gagacactcc ctggccagcc 180 

tagttctcag ggtgaacgaa aggtcattat cactgcatcc tctagtcatt tgcttcttcg 240 

ctaattaaca catcttgagc acctgcgatg ttccaggaac aggagatggc agcgtgcaag 300 
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<210> 227 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 227 

ttgctgaaat gggcacttct gctgtgatgc tagtgttgat tttgctctca gatgaacaca 60 

atgtctcata ctaaccaaga agcaagaaaa gccccatgca ttcatttttc acttggagtg 120 

acaatgggag aggtcaggaa tcaagttcac tttcaagatc taagggagtc cactatctgt 180 

gcaattgtat ttggcttttt tttgcactgt ttcaatgctg gtaattgaaa ccattttaat 240 

atatttggtt gtattcactt tatatgtcct tccaaaaatg ttgttgtgta cataccatgc 300 



<210> 228 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 228 

gctatgtatt gtgtcctacc atgaattcac tccatgctag ccacattggc ctgtatggct 60 

attccttgga cacacctagg atgttcttgc ctcttagctt gcctaccttt ctctcatcat 120 

ttgggcctca gcgaggatat catctcctca gagaagcctt ctgtgaccat gctatctaaa 18 0 

atactccagc acttcagtca ccctttatcc cattactctg ctttttcaga aacattggtg 240 

ctccctgaaa catatttgtt tacttgctta gtgtcttttc tcccgcacta ccatgtaagc 300 



<210> 229 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 229 

gattttcgaa actcttcagc tacttgccct tttttatctg aaaccatcat accttctgaa 60 

agaaaaaagc atatcttcat tgacataaca gaagtgagat ggcccagtct tgatacagat 120 

ggtaccatga tatatatgga gagtggcatt gtgaagataa catctttaga tggtcatgca 180 

tacctctgcc tgcccagatc tcagcatgaa tttacagtac attttttgtg taaagttagc 24 0 

cagaagtcag actcatctgc agtgttgtca gaaacaaata ataaagcccc aaaagataaa 300 



<210> 230 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 230 

acttcttgtt tgcctttttt ataaggaaat gttggagagt tacatcattg ctaatgtaga 60 

aatgttaagt ggaaaaatat acagtttggt aaaataaact agattctaca tttatttgtg 120 

ggtttttttc ccctcctttc tttccacagc acttttgata tcaagcaagt ggcttccttt 18 0 

ttgagatatt aaaaaaaaaa agaaaaggaa aaaagtaaat gannnnnnnn nnnnnaaccc 240 

tttctnattn gnattngttt nagnattgng aagttgngtt aaanagtnct agntagaaat 3 00 



<210> 231 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 231 

tgattctttt tgttnttttt tttgatattg acaaaagctt anncnttncn attaaaaang 60 

ccactaatta gactttttan ntaaaaaang tagggggttt taaaactact ttcctactac 120 

caaaaaatca naaagtatct agctttctaa atngggaaag caagcaatgt tataaaaacn 180 

ctgaaggaat ctctttcttc gggacctttt gttaaactcg gttnaagctg taaaccttat 24 0 

ttaaaataaa atttaccaca naacaggaaa tanaacctgg ggaanactcn aaatacncct 300 

<210> 232 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 232 

ggaagccaag gcctggagct gcaggtcccc cggcatctct ctctgtcccg gcagcccagg 60 

atggcctggt gcccccacct gctgcagcag gagccccaag gagtgctagc tgagggtggt 12 0 

tgctggggtg gtcctcatgg acagtgaggt gtgcaagggt gcactgaggg tggtgggagg 180 

ggatcacctg ggttccaggc catccttgct gagcatcttt gagcctgcct tccggtggga 24 0 

gcagaaaagg ccagaccctg ctgagttaga ggctgctggg atccactgtt tccacacagc 300 

<210> 233 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 233 

gaggaagagg cctgctccac ttgtctggga acctgggcag gaggcacaga ggaagccaag 60 

gcctggagct gcaggtcccc cggcatctct ctctgtcccg gcagcccagg atggcctggt 120 

gcccccacct gctgcagcag gagccccaag gagtgctagc tgagggtggt tgctggggtg 180 

gtcctcatgg acagtgaggt gtgcaagggt gcactgaggg tggtgggagg ggatcacctg 24 0 

ggttccaggc catccttgct gagcatcttt gagcctgcct tccggtggga gcagaaaagg 300 



<210> 234 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 234 

ggaacataat tagcttactg atttgatggt tctgtgtagt tcctgaaact cttggctctt 6 0 

gtttgccttt ctttaactct ggctccttct ccttcttctg tttgtgtatc tgtttaattc 12 0 

attgagtgag gaggacaggc agaactgtgt ctgccaagga ccggatgtac ttctttcctt 180 

gctcttggtt ttttgctcac ttttatatgt aaggtattag tacaaaccta aaggagagaa 240 

agtagaggat cagatcattg ggacttgttc tggtttcaag aaagaattaa caaattgccg 300 

<210> 235 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 235 
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gttggctcaa gggccaccag aagcatttct ttattattat tattttttaa cctggacatg 60 

cattaaaggg tctattagct ttctttccgt ctgtctcaac agctgagatg gggccgccaa 120 

ggagtgcctt ccttttgctc cctcctagct gggagtgacg ggtgggagtg tgtgtgccca 180 

ggtgggggtg tctcctggct gggaaggagg gaaagggagg gagagttttg cgggggttgg 240 

cagtggagag caggctggag aggagatggc taatagctgt ttaatggaaa cctgctgggc 300 



<210> 236 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 236 

gaatcatcga aggttgagac cgtgtctagt tacatagtta taaataccca tctatgtact 60 

gatgccttct aaatgtctat ctccagtatg gtcttttcct ttaagctcta gatccattga 120 

caccctcacc atctctaaaa ggcatttcaa actgaacaca tctgatacag aacttttcat 180 

ttccttccca actttgccca cgccagcctg ctcctccttc acgctttcca cttagtatat 24 0 

gatcccacta ttcactcagt ctctgaagct taaaacctag gattcatcct tgactactgt 300 



<210> 237 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 237 

caggacatgg agcagtacct gtccactggc tacctgcaga ttgcagagcg gcgagagccc 60 

ataggcagca tgtcatccat ggaagtgaac gtggacatgc tggagcagat ggacctgatg 120 

gacatatcgg accaggaggc cctggacgtc ttcctgaact ctggaggaga agagaacact 180 

gtgctgtccc ccgccttagg gcctgaatcc agtacctgtc agaatgagat taccctccag 24 0 

gttccaaatc cctcagaatt aagagccaag ccaccttctt cttcctccac ctgcaccgac 300 



<210> 238 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 238 

cactggctac ctgcagattg cagagcggcg agagcccata ggcagcatgt catccatgga 60 

agtgaacgtg gacatgctgg agcagatgga cctgatggac atatcggacc aggaggccct 120 

ggacgtcttc ctgaactctg gaggagaaga gaacactgtg ctgtcccccg ccttagggcc 180 

tgaatccagt acctgtcaga atgagattac cctccaggtt ccaaatccct cagaattaag 240 

agccaagcca ccttcttctt cctccacctg caccgactcg gccacccggg acatcagtga 300 

<210> 239 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 239 

atttcctcca gtcctgggcc ccatccttga gggccttccc agccagccag caggagaggc 60 

aagaactggg ggaacacagg aacctagggg aggaggggag cgctgggcat cctcaggctg 120 

gcggccaagg cctgcccctg gaggcactag aggagggcat ctgtctgtgg gagcccagag 180 
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gctgcaggga ggaggaggag ggaggtatct ggtgtgagcg ttgcccctgc gacatttggg 24 0 

accacacagg tgggcttcct tattccctga caaagcctct gtttccagct cttccgccct 300 

<210> 240 
<211> 274 
<212> DNA 

<213> Homo sapiens 
<400> 240 

catgagtgat attttggtct gggtttcctc ttaagatttt agtttgtctg aattaaggaa 60 

aaatgttttt aatatacatt cttattttgt cccacccctc cagaaataag ctggaaatct 120 

taactttttg gggggtcttt tttggtgttt taatgggccc agaactgtgg tttaaatttt 18 0 

tatgtatgta ttttcttttt tgtggagtat aaatttaaaa actggatttg ggacctaaaa 24 0 

tactcctcag gttgatgtat tcatgaaagt ttta 274 

<210> 241 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 241 

ctgttgcctg ccaagctcag ggcccattta tcatgcatct tcccatcctt gtctccccca 60 

actgtccctt acctgagtca caatttcgcc aaagccaaag ggattgtcct aagccaatgt 120 

tgatttatca ctcttcctgc tcaaaagccc ccaagatcac ctatcaatca cctacttgag 180 

tgcaagcttt gactctgtca cctgacattc aagtccccct ctgcccccat gccagtctta 240 

tcccctcccc tacatatgcc ctatgcctca gtttgccttc cctccacttt aaaaagcctc 300 

<210> 242 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 242 

ccgctggcta tgtggacgct ggggcagagc caggccggag tcgaatgatc agccaggaag 60 

agtttgccag gcagctacag ctctctgatc ctcagacggt ggctggtgcc tttggctact 12 0 

tccagcagga taccaagggt ttggtggact tccgagatgt ggcccttgca ctagcagctc 18 0 

tggatggggg caggagcctg gaagagctaa ctcgtctggc ctttgaggta atggggggtg 24 0 

gcggtggtgg ggggtgctta gtggctatgc tcaccccgct ccaggaggcc tattttggta 300 

<210> 243 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 243 

caagatctgg aggaatgcag agaggaactt gatacagatg aatatgaaga aaccaaaaag 60 

gaaactctgg agcaactaag tgaatttaat gattcactaa agaaaattat gtctggaaat 120 

atgactttgg tagatgaact aagtggaatg cagctggcta ttcaggcagc tatcagccag 180 

gcctttaaaa ccccagaggt catcagattg tttgcaaaga aacaaccagg tcagcttcgg 240 

acaaggttag cagagatgga tagagatctg atggtaggaa agctggaaag agacctgtac 300 
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<210> 244 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 244 

agtaaatttt ttatgcatat tttattgcaa taaaaaatga aaacagtttc aatctaggag 6 0 

gattttggat gcatctatgc cttgagaaat gaatggtttg atgtaaatgc atggtagcaa 120 

gaataaataa ttatgttaat tcatataata tgttatatat agttttaaag aaaattctat 180 

cactgtcttc ctatgggtag ggctataatg tccagttctt tcagggatta agagggtagg 240 

gtctgaagtt aatccttgtt tgtcgtaatg ttattaattt attcaaccaa gacttaattg 300 

<210> 245 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 245 

tagacataga aaacatacag taagaatatg gtattataat cttacgggac cactgtcaaa 60 

tacgcggtct gtctttgaaa agttgtaatg cggcgcatga ctataaatac ctagctggtt 120 

agcatttaca ttccttgcca gggagtttga aatttatact atagaaataa ctttaggttt 18 0 

taggtagagt taaagaggta aagcacatgt tgccacaacc caggaaagta tttttaagaa 240 

agattggatt ttcctacctt tagagatcta aaaaaaattt aatataaaaa atcattttgt 300 

<210> 246 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 246 

tggaatattt gctgtgaagg gagaaaggga gagaaaactc ttctgaggat catttgtctt 60 

ggtagtatag taaaaccaac cagctgaacc tttcaggcta caagagaacc cgggtcggta 120 

atgtcttttt aagaataatt tttaattgct tataacaagc atattttgtg gcatttgaac 180 

tatatttact gctccaatat ccgttatttt ccaaaggatt ttgtatcttt ttgaaaatgt 240 

ttacatcatc agatgatcca cagaattcac tttatgtgag atctcccgag agtttccatc 300 

<210> 247 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 247 

gtgttgctca gtgagcagac ccgactccag aaggacatca gtgaatgggc aaataggttt 60 

gaagactgtc agaaagaaga ggagacaaaa caacaacaac ttcaagtgct tcagaatgag 12 0 

attgaagaaa acaagctcaa actagtccaa caagaaatga tgtttcagag actccagaaa 180 

gagagagaaa gtgaagaaag caaattagaa accagtaaag tgacactgaa ggagcaacag 24 0 

caccagctgg aaaaggaatt aacagaccag aaaagcaaac tggaccaagt gctctcaaag 300 

<210> 248 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 248 

gagaggatca cttgagctta ggagttcaaa tccagcctga gccaacataa caagactttg 60 

tctctaaaca aaacagttat tgtttaaaga atctgaaatc ttcatcttta attcaggtag 120 

caatgaatcg agcccaagtt tgtttgatat ccagttccaa gtctggagag aggcatcttt 180 

atcttattaa agtatcgaga gacaaaatat cagacagcaa tgaccaagag tcagcaaatt 240 

gtgatgcaaa agggctatca aagggaggct ttttacagag aactaaggaa gagaaggagg 300 



<210> 249 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 249 

ctagcctggg caatatagta cgaccctgtc tttactaaaa atgcaaaaat taaccacgta 60 

tggtggctca cacctgtagt cctggctact gaggaggctg atgcaggaga atcatttgaa 12 0 

cccaggaggt caaggctgca gtgagctatg attgcaccac tgcaatccag cctggacaac 18 0 

acagtgagac cctgcctcac aaaaattata ttctgatttt ctgagtccat gaacacattg 240 

tccaaatgga tttttctagc tcctccaagt tacagatagt tccacgcaca cacagaactc 3 00 



<210> 250 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 250 

aggaaggtgg aggggcagga acaggacgga caggccccgg gctctggcac atcctgggga 60 

acaagggacc acaaggacgg gggcagtctc cagacttccc ctgggcgctt gaccccaggc 12 0 

cttgcagggg agagagccag ggcctccctc aggtctttgt tcatgctgtt ttccctgccg 180 

tggacaccct ttcccgctct ccgattctct aaatcctgcc ccatctccca gatcttgttc 240 

atgtccaagc ttttccagga agtcttagca gctcccacac cgcagagctc gagatgtctc 300 



<210> 251 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 251 

gaaggcagaa gtgtaaatga acatacagaa gaaggagaaa gcctgctgtg tttggcttgt 60 

tcagcagggt attatgaatt agcacaagta ttgcttgcta tgcatgctaa tgttgaagat 120 

cgagggaata aaggagacat aactcccctg atggcagctt ccagtggagg ttacttagat 180 

attgtgaaat tattacttct tcatgatgct gatgtcaact cccagtctgc aacaggaaac 240 

actgcgctaa cttatgcatg tgctggagga tttgttgaca ttgttaaagt gctccttaat 300 



<210> 252 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 252 
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gcacttctct ctcactggaa agagaactgt tctcctttct ctttcttctg cctattaagc 60 

ctctgctcct aaactcctca tgtgtgtctg tgtcctaaat tttcctggca tggcaggaca 120 

aaccccgggt atttaccaca gacaacaaaa ccgcttcact atgatgtatg catgctgcaa 180 

aggaagagac agaatcttgc tctatcaccc agctggagtg cagtggcacc attgcagctt 240 

actgcagcct caaactcctg gctcaaggga tccttcagct tcagcctcct ggttaactag 300 



<210> 253 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 253 

gtctgatgca ggagaattgc taaaacccag gagggagagg ttacattgag ccgagattgc 60 

gccactgcac tctagcctgg gcgacagagc aagactccgt ctcgaaagaa agaaagagaa 120 

aggaaattcc ccagggaagt acctcggctt atttcataaa caggtactga aggaagcaga 180 

ggcatgtgga ggacttcccc acctcgtgca gctatttggg ccgtggcatc tgaaatttct 240 

tatttcagag tcaccccttt gatgaccttg gcagtgaact gcagtcatct gtttaggcct 300 



<210> 254 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 254 

atgttacaga catgaaatat gaacagaatg ctaaaagaac ataaaagaat aagagctcct 60 

taaagattat aaataaatgg tgatgttaaa gtaatagcac cattggacga agctagggaa 120 

tcaacacttg acagaaagat acatattttt tttatacaaa ctacatatat ttgagcaatc 180 

aagtagtaga catagagaat tttcttttta tggaagtact ctaataagta aagggctgat 240 

agaattatat cagcattttc tagctcctgg ggaattatgc attgggcatc catggctgct 300 

<210> 255 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 255 

gctgcctgtg gcatagccac tgctgtacgt ttttggttgt tnttaagaaa ctcgatgaag 60 

aggggtgtca ttctgggctc ggggtggttg ccaatttttc accagaaagg gagccacccc 120 

ttgcaaccac ttctgtctcc gttagccccc cctctgccct cctccaagcc aaagcgtggc 180 

ctggcttttg tcttcccatt tagttttcct cttttaccct tccttttgtg cttaatttat 240 

taaaatagtt gctgtataat ttattttcat aaactataaa aaaatactaa atggttaaaa 300 

<210> 256 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 256 

acagtctcgg gtttcatatt ttgctgtttt tgatggacat ggaggaattc gagcctcaaa 60 

atttgctgca cagaatttgc atcaaaactt aatcagaaaa tttcctaaag gagatgtaat 120 

cagtgtagag aaaaccgtga agagatgcct tttggacact ttcaagcata ctgatgaaga 180 
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gttccttaaa caagcttcca gccagaagcc tgcctggaaa gatgggtcca ctgccacgtg 240 
tgttctggct gtagacaaca ttctttatat tgccaacctc ggagatagtc gggcaatctt 3 00 

<210> 257 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 257 

atagaactag gcactgattt gtttatattt atcctgctcg agacacatga tgtttcatgt 60 

atctgtggct ttttatagtt taaaataatt tctggaaaag tcatagtcat tatctcttta 120 

accgctccct ctcttccatt ctctttgttc tctcttcctc gaactcctgt tagtcatttg 180 

atcctccata tctctgaata tttttgtatt tcttttatta tttatttctt gtctctgcta 240 

cattttacat tgagtaaaag tgggatgtga cagtgggaaa tcattagtga cttagaaatt 300 

<210> 258 

<211> 285 

<212> DNA 

<213> Homo sapiens 

<400> 258 

tactctatta tattgtgcat gctcctgatt tagctgctct tggcatcatt ggtcgcagtg 60 

gaaccttgaa atgcatctgg ctagatttat gctcaaatca ttctcagtta gccttttagt 120 

gcctcttcaa aggttttttt ttgtatgttt tctattctta ataaaagctt aggattaatt 180 

agaaagaatc tgatatggtt atgtttcccc ttgtgtacgc tgacctcatt catacgtttt 240 

tcatagtcca gtggtctaaa cgctttcaag agcccagctc cttgg 285 

<210> 259 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 259 

gccttctctg gcctcaccaa ttaggtcaaa tgttccttag aatgtgttgt ggggcatggt 60 

ctctccctgt gaggacctgt ccagctggac ctccgccttc ctgcgactgt attggtgtct 120 

ttccctctca agcctatgag ctctgcaagg gcagggaccc tgtatgattt tgcctatcgt 180 

atgtcctcca gcccccagca cagcgcctgg tgtccagtga gagctcagca aatactttgt 240 

gagttaagga caggcggctg ggtagatgga tcgtctgcct agacagggca gttattcgct 300 

<210> 260 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 260 

gaaaagggag ccgcgcagcg cctacgggag tccggcggca gcagccggta ccggcaacca 60 

cgggcagctc tcagggaatc tccgtcgtga ggccagaggc tccagtcccc gcgagtccag 120 

atgcctgtcc agcctccaag caaagacaca gaagagatgg aagcagaggg tgattctgct 180 

gctgagatga atggggagga ggaagagagt gaggaggagc ggagcggcag ccagacagag 240 

tcagaagagg agagctccga gatggatgat gaggactatg agcgacgccg cagcgagtgt 3 00 
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<210> 261 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 261 

tttgctttca gtggttggct ttcactgaaa gaaagtgtaa aaaaagtcag aatttatagc 60 

tttcactatg tccaagacta ggactgggtt ataaagattt tcttttgtga aggaaaataa 12 0 

aagaaaattt gccactactg catttacttt actattgtaa acttaagatt cattccttag 180 

tctttggaat tttgatgtct caaaaccaga tgagtggaag tgctgaattt gcaaaataaa 240 

gctaagaatg cttaactctg cactttaagt tctactctga ccaaattgaa gatgagcaga 300 

<210> 262 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 262 

ttttttaaga gataaggtct tgctatgtta tctaggctgg cctaaacttc tgggctgaag 60 

tgatcctcct gtgtagctgg gactacaagc atgtgccacc aatgcctggc ttctcacact 120 

gttttgtaac atagatatgt gaagatgtgt attatagaat tgtttgtaat actgtagtgt 18 0 

tgtaggcaat gtgactgtct atagggaagt ggacaggtta tttgtggtaa atactcatgg 240 

aaaacggtca agcagttaaa agcaatcaat tatggtcacc cagcaatgca gataaatctt 3 00 



<210> 263 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 263 

agaacaggga gaagagagga agagggagct gcaggtgcca gaagagaaca gggcggactc 60 

tcaggacgaa aagagtcaaa cctttttggg aaaatcagag gaagtaactg gaaagcaaga 120 

agatcatggt ataaaggaga aaggggtccc agtcagcggg caggaggcga aagagccaga 180 

gagttgggat gggggcaggc tgggggcagt gggaagagcg aggagcaggg aagaggagaa 240 

tgagcatcat gggccttcaa tgcccgctct gatagcccct gaggactctc ctcactgtga 300 



<210> 264 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 264 

ttaaaggtag ttttagaagg aagtacaaat tggctttcat cttgcaaaca atcgtttttt 60 

acttcattat cttaatttgc tttgtcactc ataaaaagga aaccatacct gagttgtaga 120 

caatgaggaa acacttgagg cttctgctgt gtgttctttt gttattgttg ttattgttgt 180 

tactcagtaa cttgaatatt gtttaatgtg ttgtaagacg tagagtttat ctcaagctgt 240 

taaaaatggt aatgtacaaa tgtgaataga cacttatcta tataatatgg gtaagttttg 300 

<210> 265 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 265 

caggaaagtc ttcctagagg taatttttaa gctgattgtt ttagaattag tagaagcttg 60 

ccagatggaa aagtccaggc aaagtgtaac atgaatggga aaggccacag tctagaaatg 120 

gcagagtgtg ttcctagttt gtttgtttgt ttgtttgtac ctgccttgtt ccaggaagga 180 

tttaatgtgg tttatattcc agtcctttaa tgctggaagg gctgagatga gactgaaaga 24 0 

tgggcaggaa gtatatcatc acaagctttg tgtttgatgt taatgtgtat gatttttata 300 

<210> 266 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 266 

tgtgccacca cacccagctc attattatta ttattattat tattattttg agacgaagtt 60 

tcactcttat cccccaggct ggagtgcaat ggtgcgatac tggctcactg caacctctgc 120 

ctcctgggtt caagcggttc tcctgccttg gcaggcacct gtagtgtcag ctactcgaag 180 

gctgaggtgg gagaatcgct tgaacctggg gggcggagat tgcaatggtg tggtctcggc 240 

tcactgcact cgagcctggc gacagagcaa gactctgtct caaaaaaaaa aaaaaaaaan 300 

<210> 267 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 267 

atataactct ggaggtcagg acataggaga tattgattca ggacttgcca gagtatggtc 60 

ttggggtgtg ccctgatatt acaaacaggg atcttagtgg ctaggtgatg aggccatggc 12 0 

aaatgtagat ggaccaagat caatttgcct ttctagatga ggttttctag gtgaaatgtt 180 

tttgaaacta ttttgtagcc tagtataatt tataaaagta gagagaaact ataaatataa 240 

atttggaagg ggttagctaa aaggagaaaa cagcagaatc ttcatatata tagaaatgga 3 00 

<210> 268 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 268 

cctacttatt ggatgttggc tctttggtgt catggagatg gctttactgt aggtttgtgt 60 

gtgttgcatt acttttcatt gggattgaac tgagaaataa caaacaagct ttaagtggga 12 0 

aattaaaaaa aagaagtaac ctatgtagat ccaaacttaa aatgtgagaa attattgaaa 180 

tttcattttc tacaaacttg aaattagcct gctaattgta aagttgtttt aataatgctg 240 

acaaatgtca gttacgtttg caaaggagtg tatggttcta ggtatttgcc tactgttacc 300 

<210> 269 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 269 
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cctacttatt ggatgttggc tctttggtgt catggagatg gctttactgt aggtttgttg 60 

tgttgcatta cttttcattg ggattgaact gagaaataac aaacaagctt taagtgggaa 120 

attaaaaaaa agaagtaacc tatgtagatc caaacttaaa atgtgagaaa ttattgaaat 180 

ttcattttct acaaacttga aattagcctg ctaattgtaa agttgtttta ataatgctga 240 

caaatgtcag ttacgtttgc aaaggagtgt atggttctag gtatttgcct actgttaacc 300 

<210> 270 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 270 

cctacttatt ggatgttggc tctttggtgt catggagatg gctttactgt aggtttgttg 60 

tgttgcatta cttttcattg ggattgaact gagaaataac aaacaagctt taagtgggaa 120 

attaaaaaaa agaagtaacc tatgtagatc caaacttaaa atgtgagaaa ttattgaaat 180 

ttcattttct acaaacttga aattagcctg ctaattgtaa agttgtttta ataatgctga 240 

caaatgtcag ttacgtttgc aaaggagtgt atggttctag gtatttgcct actgttaacc 300 

<210> 271 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 271 

ccacatttaa gtgagatatg ggaaggagga gcagattgtt 
ttacttaggg tcaaattaag ttgtaaaatc ccccccggga 
gaattgtatt tggaagaaga actggggagc ccacctctgg 
tatggacaaa taaacctctg gtattaaatg aattttcttt 
ggatttcaac caccaaccta tctggttttt cccgctgaaa 

<210> 272 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 272 

gaacgcttcc attttatacc tgtgtctagt tagtttctgc ctatctatcc aagaagcttt 60 

tatcaagggt ccaccatgtg ccagccactg aagtagatat aaatacaagg atgtgtaagg 120 

tatggatgat ggtatacgaa ctgtcatctt actggatttg tccgctctgt taaagatacg 180 

gttccgaaaa ctttttaaag ccctagagag ggctttaagg caatgtagca tcatatatag 240 

aggcatcaac ctgttcatat ctttctattt aacagaactg tgcacctggg cacaagggtg 300 

<210> 273 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 273 

gaatggcgtg aacccgggag gcagatggtc ttaaagtggg gagacccggg ttacaggcct 60 

gactgcatca ctaactcgct gtgtgtccct gggcaagtca gtgcagtgca gtagcctctc 12 0 

cgtctccgac tgaggagcaa agccctcggc tcaagatcct cacctacttc acagggattt 180 



tttgaaggga 


ggaagagcag 


60 


ttttgtatgt 


aagtcaaagt 


120 


tatttttttt 


atgtccctca 


180 


tgggggattc 


tatatattcg 


240 


tgttgggtga 


tggaatcagg 


300 
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gaaatagtgc agtcaacagg aaaagaaaag cgctatagaa atgctcgacg ctatcacttg 240 
gggcccacgt ggaagtatca acgtataaat tggcccaggc agacagaagg atgcagggga 300 

<210> 274 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 274 

ggaaccaggg gctgcagaac cagcccctcc ccaatgagga 
ccatggagaa caccaggagc cacagacccc agaccacagg 
999cggccgg 99cag£T9tgt ctgctgcctc gtttatggga 
ctgctgcctg cagtgctcct gtcccctgca gtggctactc 
tggttggcat gaaaatgtcc tgaggctact gtgacaaatt 

<210> 275 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 275 

ctttgggaag cagaggtggc aggatcattc cagcccagga gttcaagacc agcctgggca 60 
acacagtgag tgagaccctg tctctattta agaaaaaata attaagaaat tttattaaaa 120 
aagaagaatc aggaaaccaa gtccaaccca actaaacctc aaatgaacca gcccctaaca 180 
cagatgaggg gatttgggac tgataagctc tgtgctgtgt ccatggcccg tcatttatca 240 
aggctgcagc tttgtaaatg tggctatttt tatgttgtgt atagtttcta tcatttattt 3 00 

<210> 276 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 276 

tttgtatttt tagtagagac agggtttctt catgttggtc aggctggtct caaactccta 60 

acctcgtgat ccgcctgcct cgacctccca aagtgctggg attacaggca tgagccacca 120 

tgcccagcca aagatcattt ttttatatag acttcagccc tttgtaaata ttgtaactgg 180 

ggagtataga gtagaaaaaa agtatagtta aaacatttgt tctacaaatt aacctttaaa 240 

aatataatta ctgctaaaaa tagagtgctg ttacacttaa ggaaaattag tgccattttg 300 

<210> 277 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 277 

ctcacacagc atgtgtcaga tccatggggt aggagtcggc cagagacttg gtaacagaca 60 

gattgctgga tcccacccct agactctctg attcagttag tttggggtaa ggcgcaagac 120 

tgaatttttc acaagtttcc cagtggtgct gatacttctg gtccaggaac ttagtgggag 180 

agaacgacta atctagacca tttcacttca cattctgagc ttcttgtaca ctgtcacact 240 

gcatcctttt aacaatgcat tccctatcct attgcaatac tgacatctca tcaatatttt 300 



ccccctctgg 


acgcccctcc 


60 


agcacacagg 


ggagggcacg 


120 


tttgctccgc 


gtctagcaca 


180 


tgggcctacg 


ggcctaatcc 


240 


tccacaagct 


gagtggctta 


300 
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<210> 278 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 278 

ctgacaactt gattgggttc tccttcaggt ttgaagcgcc ctcgagaagt gtctaaagga 60 

gacagttgat agccaaacaa cagttttgga ttcactgact gattatgaaa gaagcagtag 120 

actggtatca agaatcagtc agcaaggagg ccctcaccag acgccagtgc catgttcttg 180 

gacttctcag cctccatatt catgaactaa gtttttggaa tccttaggct tccacgtgtg 240 

gaaagcctga gctaacctac tggaggatga gccatcacct ggagcagatt caggccatcc 300 

<210> 279 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 279 

ggtaaaccta tttatataat agaaggatga ttataaacat 
gatattatat attaaatggg cagataatag aaatctgtcc 
tttatgttgc cttatttttt gttttctgtg aactccaaga 
aacagatgta atattgctga tttaacagtt tagggatact 
ccaagataca aatttaaatg gaacctttta tgaagcttca 

<210> 280 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 280 

ataactgctt gcgaagatgt agtttctctt tggaaagcta cacacacgag attatacaca 60 
tcaggcactg gaactatctg taatactgga acctctgcga agtgccaggt ataaagtttt 120 
tcccactgcc aagcatccag agctttggga aatttggaaa tcagagagat cagggcattg 180 
ttttgttcct ctgatgatga aagtgaaaag caagtactac tgaagtctgg aaatataaaa 240 
gctgtgcttg gcctgacaaa gaggaggcta gttagtagca gtgggaccct ttctgatcaa 300 

<210> 281 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 281 

caccatcgaa tatttttatt tattttgaga gacagactct gtcacccagg ctagtcttaa 60 
actgttggtg aatcttaagt gattctccca cctcagcctc ccaaagtgct gggattacag 120 
gcatgagcca ctacccttgg ctgtgatcaa gtatttagtc tgttgttaaa tgtttactaa 18 0 

atagtctgaa gtagagaaaa tagcacccaa tctaaaataa ggtgaggtct agtcacttat 24 0 
ttaaatctac attttaagct atagtttact attagtttaa actttaagac aggtaatgtt 300 

<210> 282 
<211> 300 
<212> DNA 



ttaataaatt atatcaaata 60 

aagcaaaact ctggataatt 12 0 

aaaatgagat accagtttgg 180 

ccccaagttc aataattttg 24 0 

tagtgtgtga agaacttacc 300 
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<213> Homo sapiens 



<400> 282 

gcaaccttcg cctcctgggt tcaagtgatt ctcctccctc agcatcccaa gtagctggga 60 

ctacaggcac gtgccaccac acccagctaa tttttgcatt tttagtagag gcagggtttc 12 0 

atcatgttgg ccaggctggt ctcaaactcc tgatctcaag taatctgccc actttggcct 180 

cccaaagtgc tggcattaca ggaatggagc caccgcgccc agcctgattt ctttttttag 24 0 

gtcttgtcag gaaagatatt gattcttttg attcgtgaac atggtttttg gtcgtcttta 3 00 



<210> 283 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 283 

cccaggtagc tgagactacc cacaccttgg tcccagctac ttgggaggct gaggtgggaa 60 

aatcactttg cccaggaatt caaggccgca gtgagctatg attgcaccac tgcactccag 120 

gcaacagagt gagaccctgt cttaaaaaaa gaagggagaa agtgtcagat ggtgatgagg 180 

tctggggggg aaatagagaa tggggatcag gagtgtggat ggtggtattc cctcaccaag 240 

aggtgacatg tgagcaggga gctgggaggt gagggtgtga cccgtgtgga aatcagggaa 300 



<210> 284 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 284 

ggtgtcctcc ccagtgcgcc gcgatttttg tgtccaagcc ccagagtccc tctgagacca 60 

acccccagcc agcacagact tcctgccttc ccagctcgga agcgccctcg agaagtgtct 12 0 

aaaggagaca gttgatagcc aaacaacagt tttggattca ctgactgatt atgaaagaag 180 

cagtagactg gtatcaagaa tcagtcaggt ttttggaatc cttaggcttc cacgtgtgga 24 0 

aagcctgagc taacctactg gaggatgagc catcacctgg agcagattca ggccatccta 300 

<210> 285 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 285 

aattccgttg ctgtcgggcc gccatgtcat tctggagaga gacagagtaa aacaaagaag 60 

gtgatgggta aagcgcagtc gcctgctata tattgtctat ttttggtttt tcacttacct 120 

tttatattta tgtcttttat gtacaacagg attataagta gcttgagtcc agtgaatata 180 

ccatttcatt ttgctatcct tcactgcact tagcttagag gaaataatca cagcttatta 240 

ttgattaatt aattaattaa tagatgaatg gtgaacacat gactatcatc ccaagaaatg 300 



<210> 286 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 286 
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agccaatgag gcttttgcct gccagcagtg gacccaagcc attcagcttt acagcaaggc 60 

tgtgcagagg gcccctcaca atgccatgct ttatggaaac cgagcagcag cctacatgaa 120 

gcgcaagtgg gatggtgacc actatgatgc cctgagggac tgcctcaagg ccatctccct 180 

aaacccatgc cacctgaagg cacactttcg cctggcccgc tgcctctttg agctcaagta 24 0 

tgtggctgaa gccctggagt gcctggacga cttcaaaggg aaatttccgg agcaggccca 300 

<210> 287 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 287 

gggtgacaga gtgaaactcg tatctccaaa caaacaaaca aaaagtcctt aaacatatgt 60 

gaacaaaaat tttgtgatgg aaggattcta gttaatgagt attgcatcaa gatttacatc 120 

tttcttacta aggaaaagag ttaataaaaa ttgttcttta ttttacaggc agttactgag 180 

gctcttccca gatctcagta aacagccact cagccttgaa aatggagtgt tgttgtttct 240 

aaacatatat ttatgtcatt tattaagtac agttcactta aataacataa gtagattttc 300 



<210> 288 

<211> 300 

<212> DNA 

<213> Homo sapiens 



gcatgccact 60 

tgttaagata 120 

gtttatccag 180 

ataagcacaa 24 0 

agaatggtgt 300 

<210> 289 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 288 

accactaaca gcatctactt gactactgat actttgatca tggagttagg 
tgatagaaat ttgaagagca attatatttt tcaaaaagag ttttgaataa 
gattgcaaca tgactatcaa ttcttccctt cccatcaaag gagagagtcc 
cctttgaatc ttgattattc aagtgacttg cttcacccaa tgtaacatta 
tacaagcaga ggcttgccaa gaacttggtt tgtttctaat gcttagaaga 



<400> 289 

tgtccttatc tgaaattcag cgatcttcat gaataagcat ttctctgatt gtggnatatg 60 

cctttaattt tatttctaga gtgacaaatt tttggttttg acagtttttt tctagcttta 120 

tagtttcttc ttggggagag aatatgtcaa cctcactcca tcatgctgaa gtaaatcttc 180 

atctcttaat tttatctctc aaaaatatcc taaggattcc ctctggagcc tgataagtaa 240 

ttgcagtatc tggtttctat ggttggatga ttcaggattc caggaataat agttactttt 300 

<210> 290 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 290 

ggaaccatga gaaccgaagc tagaattgct attgaattac tttattttct cttcccttat 60 

tgggtagaga tacatcatta ctggcctcag gggtttaccc aaagaaaggg tatttttgag 12 0 

caaataatgt gatttcctgg ctattttgtt gggggcttaa gatttttttt tttcaaatgc 180 
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atttttagtc actaaaaatt aactgtcgta ccatctagaa ctatactgtc cagtaccata 24 0 
gcctctagcc gtatgtagct atttgtatta agattaatgg aaattttaaa tccagttcct 300 

<210> 291 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 291 

tatgatttta tttttggcct aatataggaa tgtttaaaaa aggcttttct atgaaaatta 60 
gaaatttata cttgaaatta aaagtctaca agggggagga ccttaaagct aagctaccag 120 
taagacaatg aataattcag aagagaacac tattctttta ctgactgagt gcccaagatg 180 
ccaatttcca tgaagtcttg atttatatat atgtacacat gttatgcaca tacatgtttg 240 
ttttctaaca gttattcttt aagcttttga gataatttta gacttacaga agagttggaa 3 00 

<210> 292 
<211> 278 
<212> DNA 

<213> Homo sapiens 
<400> 292 

cccagaccta tggagtcaga cagtaggttt gaggcccagc aatctatggt ttaacaagcc 60 
atccaggtgt ttctgatgca cagtgaaatt ggggtaccac tggtattagg tttggtatgg 12 0 

caactttttc atcacttgtt ttatgtagtt gtctgatcaa ttgtgaaaac ataatgaatg 180 
ttggaaatgg aacagtaaaa taacgaaagc caactttttt tttttttttn nnnnnnnnnn 240 
nntgntttnn cccccaggnt gnanngcagg gncccaat 278 

<210> 293 
<211> 297 
<212> DNA 

<213> Homo sapiens 
<400> 293 

ggaaggcagt gggaggagag gaccaagtct caaactccag aagccccacc tccctgagct 60 
cagctcctct gccaagcccc ctcagcgcga agtcctcgtc cagagaaggc aacggcgaga 120 
aacaaatcca acatcctggg ctgctttttc cttcccccac tttttaaaag tttggtgtcc 180 
aagtcacttg acaaacccag accctaacaa tgatattttg tgtagaattc tgggatcaaa 240 
atataatttc aaaaataata tattttctga catcccccaa aaaaaaaaaa aaaaaaa 297 

<210> 294 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 294 

ggaacagttt gagcaaaggc tctcaagtaa tagggtgtct gacttgttca tttttgaaag 60 
tagaactaat aggatttctt attggaacgt agggtgtaag agaaaagagg agtcaaaaag 120 
agccacaaga tttttggtct cagcaattag aaggatagaa ttgacattta ctgagatttt 180 
tgtttttgtt tttgagacgg agtttcgcta ttgttgccca agctggcgtg caatggcgtg 240 
atctcggctc agtgcaacct ccacctccca gattcaagcg attctcctgc ctcagcctcc 300 
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<210> 295 

<211> 299 

<212> DNA 

<213> Homo sapiens 



<400> 295 

gtaatattga tgtgattgtt gtcgcttgag aaaaaaaggc aacagctgat tctttcaaca 60 

actgtcacag aatggctggg ctgagaacgc tgcccagggc cctgcagctg gcgggagnnn 12 0 

nnnnnnnnnn nnnnngtgcn tgctgcaaca tntggttana tngtatcctt ccctanagnt 180 

gctacnnctt nnatcccctt gtnaatatgt tgagntnnct tngcnttcnn gntnntccng 240 

ntnnttgaca cntatgnaan ttntntngtc tngctctgct nga timet tn nangctgee 299 

<210> 296 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 296 

gcagaacctt ttcccctcta ctcttgtcta aaagttctgt gtggcacaca gagatgegae 60 

ctactcaatc tgacttagta aaaccatget gaaaaatttt ggtctaaaaa ggacccatac 120 

ccagcaccca tgaaataaaa gattcatctg taattgggat tcaaagggat taaattcctt 180 

tggtcatact cataaatagc actaaagtgt tataacattt tcatttacct atttttagtt 240 

ccttcatttt aacttaataa aaatcttgga ttgatattct tttttttttt ttttgggacg 300 

<210> 297 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 297 

gctaggatta caggtgtgag ccaccatgcc cagccactta tctttaaagg attaagttta 60 

tgtttcctac tatgggaaac catcccaccc caaacttgat gaeegcatta tgtgctttta 120 

tagaacatgg cacttctcca ggatagcatt tattctgttt tgtaagtgtg aatgtaatta 180 

ccctacacac agcatacaca taatcttcat attctttgee ttgtcttgtg aaggcaaggg 240 

ccatgtctat ettattegtc attagattcc cacatccaac atagtcctgg ggacagcacc 300 

<210> 298 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 298 

ccaaatctgc ctagagattg agttcacagt gtatgttctg ggggcgctgg tgeagtcage 60 

ggtccagtct ccagcctgca ggcgtgcaca ctggggtgga cgatgggtgg ccccgcaggt 120 

gtacacattt gggtggcccc ggcccctata ccccagtgtt ctctttgatc cagtcccgaa 180 

acagagggag ccttgtgtac acgcctccaa agtggagctg ggaggtagaa ggggaggaca 24 0 

ctggtggttc tactgaccca actgggggca aaggtttgaa gacacagcct cccccgccag 300 

<210> 299 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 299 

ctccattgtg aagatccagg catttttccg agccaggaaa gcccaagatg actacaggat 60 

attagtgcat gcaccccacc ctcctctcag tgtggtacgc agatttgccc atctcttgaa 120 

tcaaagccag caagacttct ctgctgctgt gatctgcaca ccctccaacc tgggcaggga 180 

ctgsJSSg^t gcagtgtgtg ttagtgccca tgtggcattg tggcactgtt gccccccatg 240 

gcggcatggg caagatgacc ttccattagc ttcaagtctt gttctcttgt ctgtggtctg 300 

<210> 300 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 300 

agcaattcca ctcctagctc cacccacagg aattgaaagc aaagacgcaa acagatgcct 60 

gtgcaccaaa gttcacggca gcatccttcg ccatagtggc agcatccgtc gtcacagcgg 120 

catcatcctt catcatagcg gcagcatccg tcgtcacagc ggcagcatcc ttcgccacag 180 

cggcagcatc tgtcgtcaca gcggcagcat ccttcgccaa agcggcagca tccttcgtca 240 

tagcggcagc atcctttgcc atagcggcaa ggtggaaacc ctgtccatcc actgaggcgt 300 

<210> 301 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 301 

tcacagatat gaaagttcag tcagaggggc tgggccgaca tctgtgcttt tccctgcagg 60 

atttttagga tcagtgagac ggtgtgtatt tggaagcatt tcaaatgtgt taccatcgtg 120 

ttacttccgt gggcacctgg tgttattggt tggactagtc aggattctcc agagcagcag 180 

aagcaatggg atgtgtgtgc atgtgtttgt gcagagacag aaagagagat tttaaggaac 240 

tggcttatgc agttgtgggg gctagcaagt ctgaaatttg cagggcgggc cagcaagctg 300 

<210> 302 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 302 

tcaccaggaa tacagtgaca ttaaaagtgt gatatggttt agctgtgccc ccacccacat 60 

ttcaacttga actgtatcta tctcccagaa ttcccacatg ttgtgggagg gacccagggg 120 

gaggtaactg aatcatgggg gctggtcttt cccgtgctat tctcgtgatg gtgaagtctc 18 0 

acgagatctg atgggtttat caggggtttc cacttttgtt tcttcatttt ctcttgccac 240 

cagcatgtaa gaagtgcctt tggtctccta ccatgattct gaggcctccc tagccatggg 300 

<210> 303 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 303 
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gccctctcca ttttctgagg aggtgatatt tgggcagatt acaaactgag gaagcatact 60 

ggatagacat caggatgaag agaataggca gttgaaaagt cccagaaagg ggagtgtgct 120 

tagagtgttt gaggaacagc aaggaagcaa gcccttgttg aaacagattg agcaaggtag 180 

aaagtggtaa aagatgaagt taaagaggta gctgagagcc agatcatgta aagccttggt 24 0 

aaggactgac ttttatttta agagggttag gaagacattg gtaggttttg actctggctt 3 00 

<210> 304 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 304 

aacaggaata tggaaagaaa ctcagagccg agttagtgga aaagtggaaa gcagagagag 6 0 

aggctcggct ggcaagagga gaaaaggaag aggaggagga agaggaggaa gagatcaaca 120 
tctatgcagt caccgaggag gagtcggacg aggaaggcag ccaggagaaa ggaggggacg 180 
acagccagca gaagttcatt gctcacgtcc ctgttccctc gcagcaagag attgaggagg 240 
cactggtgcg aaggaagaaa atggaactcc tccagaagta tgcaagcgag accctgcagg 300 

<210> 305 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 305 

aatagtagaa agggtcccca ttcctgctca gcaccgcacc tctctacccc cccacagaca 60 

cacatgcaga cacacacatg cagacaacac gcagacacac acatgcaggc actcacatgc 12 0 

aggcccatgc acacacacgt gcacacacat gcagagacat gcagacacgc aggcacacat 180 

gcacacatgc aaagacacgc atgcaggcac acgcagacgc acacagagac acacatgcag 24 0 

atacacatgc acacacacat acacacactg gcccctgttt ttctgtggtg tcactgggtg 300 

<210> 306 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 306 

cagcaaagac tttatttttg tacagaagat ggtgaagtcc aagacggtgg ctcagtgcgt 60 
ggagtactac tacacgtgga aaaagatcat gcggctgggg cggaaacacc ggacacgcct 120 
ggcagaaatc atcgacgatt gtgtgacaag tgaagaagaa gaagagttag aggaggagga 180 
9gaggaggac ccggaagaag ataggaaatc cacaaaagaa gaagggagtg aggtgccgaa 240 
gtccccggag ccaccacccg tccccgtcct ggctcccacg gaggggccgc ccctgcaggc 300 

<210> 307 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 307 

gctgcttctg gctggggggt ccttggcctt catcctgctg agggtgagga ggaggaggaa 60 
gagccctgga ggagcaggag gaggagccag tggcgacggg ggattctacg atccgaaagc 120 
tcaggtgttg ggaaatgggg accccgtctt ctggacacca gtagtccctg gtcccatgga 180 
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accagatggc aaggatgagg aggaggagga ggaggannnn nnnnnnnnna ntggccttnt 240 
gtggcctcca ccagcagctn tnnannatga catggagtcc caactgnacg nctccctcat 300 

<210> 308 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 308 

agttaagagt gtgaacccta gatttgccat ctgaaagtca tgtgtccttc agtgatgcat 60 

ttaacctctc tgtgcctcaa atttctccct ctggggtatg ttaggagtat acaaattaac 12 0 

acatgtaaag tgcttagaat agattggtac tgttaaatat gagctaacgt cacatttgat 180 

atttttttaa aaagaaaaaa tcattatgga gtctcagtcc tagagattct gattcattaa 240 

ttctgcttct cggcaaggag cgatttgctg gtgtagacat tccgggtccg tgtaaagggt 300 

<210> 309 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 309 

ccaacaccca gttctcactc tgtcatccag gctggtgtgc agtggtgcaa tgtgggctta 60 

ctgcagcctt gacctccagg acaagtgatc tcccacctca gcctccggaa tagctgggac 120 

tacagctcaa caacgcccct ctgaaagtag gactcttgga aatgaacctt gttgggagta 180 

aagctgaacc ttcacctctc ctttccagga ttctactcca ttcatacggc ctcacactga 240 

attaatggtt ctagcagcca catcactttg ttacccaatt gatctagtag taaagtcttc 300 

<210> 310 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 310 
aggaaacacc cccttataaa accatcatat 
tgtcaaacaa acaaacaaac aaacaaaaaa 
tatcatgaga gcagctcagg aaacacccac 
tgtcccacaa attgtgggag ctacaattca 
ccctatcacc atgtaaaata atatctaatt 

<210> 311 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 311 

ttntgcagat ctccagcaca agcctctgct agttgatctc 
attaaaggtt atgtttggtt cacacactgg tttccatgta 
ctcttatgat atctacatac catctcatat tcagggcaat 
catcttgcct aaaacagatg gaatggaaat gcntgtttgc 
tgtaaacacc tatggccgga taacnaagga tgtggtgctc 



caggctgggt gatctgacag 
accccatcac atctcatgag 
tcccgtgatt cagttacatc 
agatgaggtt tgggtgggga 
tgtagagatt aaagaacaag 



agctagacac 60 

acttatttac 120 

ccactgggtc 180 

cacagccaaa 240 

ataacttaaa 300 



acggtagaag 


aaggtcaaag 


60 


attgatgttg 


attcaggaaa 


120 


atcactcctc 


atgctattgt 


180 


tatgaggatg 


anggggtgna 


240 


caatggggag 


aaatgcccac 


300 
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<210> 312 
<211> 275 
<212> DNA 

<213> Homo sapiens 
<400> 312 

cctccctgga tgtgcagaca tggaggagga cagaaggccc agctcagtgg cccccgctcc 60 
ccacccccca cgcccgaaca gcaggggcag aggcagnnnn nnnnnntaag ngtgtnnaan 120 
tntnnatttn ttcctntttt ttttnnnttn aaatatnntg nnnttttttn ntantantta 180 
ttatnntntn nttattannn tntttttcnt ntnttacttt gttnttgatt ttanncnttt 240 
natntttttt ttgttcttct nttntattnn atctt 275 

<210> 313 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 313 

tcctgtcttc ttgcccaaat gttgcatttt ccaagaccac tctggcctgc catgccaccc 60 
attctgtgcc tataaaaacc ctgagacccc agcgggcaca cacacaagcg gctggacgtc 120 
aagaggaaca cactggcaga agaacacatc gaaagacgct ggcaggccat tgatggtgga 180 
acgattcgga cgccaaggga aattcggcca aggacagtag gagatcccgg ctgctgagca 240 
gccagactcc agaggaagac taccttccca tgctatcccc cttctggctc cccagccatc 300 

<210> 314 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 314 

ataagggtgg ggccttaatt cagtagaatt ggtggcctcc taagcagagg aagagagatt 60 
tttctttctc tctctgccat gtgaagacag tgaggagtcg gccgtctgca agccaagaag 120 
agcccttatc aggaacagac ttggctagca ccttcatcgt ggacctccag cctccagaat 180 
tgcaagaaaa tacatttccg tcgttgaaac cacccagtct gtggtatttt gttatggcag 240 
cccaggcaga ctaatacgtg aagcctgctc taaatagata aaataagaaa ttactacaga 300 

<210> 315 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 315 

gtctcagtgt ggcctgtggg gctggtgggc ggcctgcgat tcgagggccc tcaggtacag 60 

gacggccggg tagtgggctt ccacacagca tgggagccca gcaggccctt ccctgtggat 120 

atggctggat ttgccgtggc cctgcccttg ctgttagata agcccaatgc ccaatttgat 180 

tccaccgctc cccggggcca cctggagagc agtcttctga gccaccttgt ggatcccaag 240 

gacctggagc cacgggctgc caactgcact cgggtactgg tgtggcatac tcggacagag 300 

<210> 316 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 316 

gaaatgcctc tatgtaggtg aagtgttctc tctgcatgca acaggaaaaa ttaatataat 60 

attttcccca caaaagaaac acttaacaga ggcaagtgca atttataaat ttatatctaa 12 0 

aggggaatca tgattataag tccttcagcc cttggactct aaattgaggg gattaaaaag 180 

aatttaaaat aattttgaac gaatttattt tcccctcagt ttttgagggc attaaaaagg 24 0 

cattaaatca agacaaatca tgtgcttgag aaaaataaaa ttaatgaaaa cacagcactt 300 

<210> 317 

<211> 295 

<212> DNA 

<213> Homo sapiens 

<400> 317 

acactgtccc actccatcac ccaggctgga gtccagtggt gtgatcatag ctcgctgcat 60 

cctccagttc ctgggttcaa gccatccctc ctgcctcagc ctccccagta gctggaacta 120 

caggtgtgtg ccatcacacc tggctttaca tttttctgtg gggacttact atgttgccca 180 

ggccggcctc aaactcctga gctcaagtga tcctctgcct cagcctccag agtatctggg 24 0 

attacatatg tcggctaccg tgtctggccg ttcacatctt tggccactat ttgct 295 

<210> 318 

<211> 261 

<212> DNA 

<213> Homo sapiens 

<400> 318 

cctgaatata aagaggagga ggaagaccaa gacatacagg 
ggaaaggtgg aaaaggttta taagaatggg tgccgtgtta 
cgaaaggaag tgagtgcaga tgggaagacc atcactgtca 
aagcaggtca tgccgaccca agaannnnnn nnnnnnnnnn 
caaatncccc gggggggctt g 

<210> 319 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 319 
gggacctctg cccaagaaag cctgggtatt 
tgagatatgg cctcatggga agggaaagac 
gggtcggtgc tgaggaggaa tagtgaagga 
aaggcttctg tctcctgctt gtccctggta 
cccattcgtt ctattctgag ataggagaaa 

<210> 320 

<211> 289 

<212> DNA 

<213> Homo sapiens 

<400> 320 

76 



gagaaatcag tcatcctgat 60 
tactgtttcc caatggaact 120 
ctttctttaa tggtgacgtg 180 
nnntngccnn aacnnttcac 240 

261 



gaccaaggtt tcccccccac 
ctgactgtcc cccagcccga 
gggaggcctc tttgcagttg 
atggaatgtc tcggtgtaaa 
accgccctgt ggctggaggt 



tgagacagcc 60 

cacctgtaaa 120 

agataagagg 180 

gctgaccatt 240 

gagatatgct 300 
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caggctaagc 


ctcagattca gtgcaggaca 


60 


cttgaagcct 


cccgacttcc 


acagagtgct 


120 


tctttttttc 


ttcttttttt 


ttttnnnnnn 


180 


gcaggggcct 


gatntnggnt 


aantgnaacc 


240 


cctaancctc 


naaagtacc 




289 



<210> 321 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 321 

gaaagaccga gatagagaga gagacagaga cagagagcga gaccgtgatc gggacagaga 60 
aagagaacgc accagagaga gagagaggga gcgtgatcac agtcctacac caagtgtttt 120 
caacagcgat gaagaacgat acagatacag ggaatatgca gaaagaggtt atgagcgtca 180 
cagagcaagt cgagaaaaag aagaacgaca tagagaaaga cgacacaggg agaaagagga 24 0 
aaccagacat aagtcttctc gaagtaatag tagacgtcgc catgaaagtg aagaaggaga 300 

<210> 322 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 322 

cgccctttaa ctgcagttct gctctatttt cttttctctc tctggagctg agagtcagag 60 
ggcccttctc ctcctccttt cagcccccaa cactaagctg atggattgat aaatacctca 12 0 
gcccctcgcc ttcctcaacc cacctggcaa gtcttcttag gatctgatcc cagttttctg 180 
gaagcaatcc taccccagcc caagcttccc aagagtcgag ccttaatcct tctcacttct 240 
cagtgtcaga gcagaaatga atcctggggt tgactgtgtc cattcgggtt attagcagct 3 00 

<210> 323 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 323 

agattatgag catgtagaag atgaaacttt tcctcctttc ccacctccag cctctccaga 60 

gagacaagat ggtgaaggaa ctgagcctga tgaagagtca ggaaatggag cacctgttcc 120 

tgtacctcca aagagaacag ttaaaagaaa tatacccaag ctggatgctc agagattaat 180 

ttcagagaga ggacttccag ccttaaggca tgtatttgat aaggcaaaat tcaaaggtaa 240 

aggtcatgag gctgaagact tgaagatgct aatcagacac atggagcact gggcacatag 300 

<210> 324 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 324 

gtctgagaag tcaaggatcg gggtgctggc ctattcagtt cctggtaagg gctgtcttcc 60 

tggcttgcag ttgaactact tcttgctgtg tcttcacaag catgccccca tcctgtgccg 120 

ataagaactc cagaccccaa actcagctca tacacacacg gaagagagaa gcatctgaac 180 
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atcaagaaga gaagaagctg ctggacatca gaaactgtga aaggagagga gtttggctga 24 0 

gctccagggg aagactgcct gcacattcta tccccttttc agttccccat cctgctgtca 300 

<210> 325 

<211> 283 

<212> DNA 

<213> Homo sapiens 

<400> 325 

gtccgaagaa aaagactgtg gtggcggaga tgctctctcc aatggcatca agaaacacag 60 

aacaagtttg ccttctccta tgttttccag aaatgacttc agtatctgga gcatcctcag 120 

aaaatgtatt ggaatggaac tatccaagat cacgatgcca gttatattta atgagcctct 180 

gagcttccta cagcgcctaa ctgaatacat ggagcatact tacctcgtcc acaaggccag 240 

ttcactctct gatcctgtgg aaaggatgcn ngtgtgtagc tgc 283 

<210> 326 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 326 

atgacatcct cattatccac actgcaaagc caaccatccc tatgatgggt tcattgtgga 6 0 

tcatgactta gtgggtcaag agtttggaag tggctcagct gggcggttct tctgctccat 12 0 

gtggctgcca gatggtaccc tgctggtggg cagtctggtc tagagggtcc atgatggctt 180 

tactcacatg cctggcatct tgacagggac agctggaagg caaggttcag ctgggactgt 240 

ccacagagct cctccctgtg gcctttccag catggtggtc tcagggtagc tggacttcct 300 

<210> 327 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 327 

ggtagactgg ctagggatcc tggacccagg gttccacgta gcaacacctg ctgagttctc 60 
tgggttttct tcctgcctca tgtagcccag acttggagct gaagaagctg gaaacatgga 12 0 

aacaccaaca gctacagacc aaaaaaagtc ccaacaaagg cctgtcagtc tgccagcctg 180 
ttctgtggat ttccaactca agattgcagc atcaactcac acctgaagtt ctggcttccc 240 
tacaaacttt gaacttgcca gtccccacaa tggcataagc caattcctta aaatgaatgt 300 

<210> 328 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 328 

gtcacaggca ggtttaatgg ccagtttaaa acttatgcta tctgcggggc cattcgtagg 60 
atggtgagtg tttccctggg ctttgctcat cacttcggga catcgtggac tttaccgtgc 120 
gcattggagt gtgtgatggt gcctgagtag atctgctggc agagtagttt gagccagctg 180 
gactgggctg gccgcctgcc gcttcttgag ggtggaagag gggtgctctg agaagacact 240 
caggcagcag actctgcctc tcacttaagg tgcccccccg accccgctcc accatagtca 300 
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<210> 329 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 329 

ttttggtcgt ctttaatttg tctcatcagt gcctccatgt gtttttgatg cctttgaact 60 

ggtattttta aaatttcaat ttctaattgt tcattataga aacacaattg ggttttatat 120 

attggcattg tattttgcaa ctttcctaaa ctcactagta attctagtag ctttttttgg 180 

tagattctta aggattttct gtgtaaatag tcatgtcatt tgtgaataaa gccatttttt 240 

tttccttttc aaattttgtg ccttttattt cttattctta ccatatcaca ttggcaaaga 300 



<210> 330 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 330 

tcaaggatcg gggtgctggc ctattcagtt cctggtaagg gctgtcttcc tggcttgcag 60 

ttgaactact tcttgctgtg tcttcacaag catgccccca tcctgtgccg ataagaactc 120 

cagaccccaa actcagctca tacacacacg gaagagagaa gcatctgaac atcaagaaga 18 0 

gaagaagctg ctggacatca gaaactgtga aaggagagga gtttggctga gctccagggg 240 

aagactgcct gcacattcta tccccttttc agttccccat cctgctgtca gccacattta 300 

<210> 331 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 331 

accgccctgt ggctggaggt gagatatgct ggcagcaata ctgctctgtt actccttgct 60 

acactgagat gtttgggtaa agagaaacat aaatctagcc tacgtgcaca tctgggcaca 120 

gtacctttcc ttgaacttat tcgtgataca gattcctttg ctcacatgtt tccctgctga 180 

ccttcttccc acctgttgcc ctgctacact cccctcgcta agacagtaaa aataatgatc 240 

aataaatact gagggaactc agaggccagc gccggtgcgg gtccccccca tgctgagcgc 300 

<210> 332 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 332 

ggaaaaacaa caggtttgag tcctataaag ccataattta actccagtag ctgatgtcag 60 

acaagcttgt cctatgtcct atttgagtgg cagcagcgcc agcccagcaa gaaggctggg 120 

ggttgtcaag gttgtcccca gaccttgctt gcagtggttg gagaacccag ggggctgcct 180 

tgggccctct ggccagaggg aagcgggcag ctctagccct ggagattgtg gtcacattgg 240 

ggcttgttta ggattggagg gccaggtcac ctccccagcc accctccctt ctctcctctg 300 

<210> 333 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 333 

cctcctactc ccaaacaaat ctttggggaa aaaaaaacta ccaactgtca gccatgggcc 60 

tgacggcgct aagctctggg gctccgtgca ctgacgtggg gccagccaca gggaggcggg 120 

gatcaagtag cggaggccag gattttggcc acctcccggg caagttgcag ggcagtggcg 180 

ccgggagcaa aagcagcatg atgcagctca tgcacctgga gtccttttat gaaaaaacct 240 

cctcctgggc ttatcaagga agatgacact aagccagaag actgcatacc agatgtacca 3 00 

<210> 334 
<211> 262 
<212> DNA 

<213> Homo sapiens 



<400> 334 

gccatgccca tttgtttact cattgtctat ggttgctttc atgccctcac agcaaaggcg 60 

agtagttgtg atggatcaaa tggcccacaa agcctgaaat atttactctt tgacccttta 120 

cagaaaaaaa ccttgttgac ccctgcttta gagaatgaga agccatgcag ggatcagtga 180 

tgccagagga agggaaggaa ctgcttccag ctattgtgac aataataata ataataatat 240 

tgggtctttg actagaacgt gt 262 



<210> 335 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 335 

tcntntctcn ntattnttgn gtagtncctc ntttccttgt ncnntnntcn nctnttgnct 60 

tttgcggacc ctcgattcta tctcatatga gtgagaacgc ttaccagtgc agcgaatgtg 120 

ggaaagcctt ccgagggcac tcggactttt ctaggcatca gagtcaccac agcagtgaga 180 

ggccttatat gtgtaatgaa tgtggaaaag ccttcagcca gaactcgagc cttaaaaagc 240 

accaaaagtc tcacatgagt gagaagccct atgaatgcaa tgaatgtggg aaggctttta 300 



<210> 336 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 336 

gaggacccac tcccccagga ctcctttgaa ggcgtggacg aggacgagtg ggactagcct 60 

gcgcccccgt cacctccacc tcacctgtgc tgccacttcc tagtgcacac ctcacggctc 120 

atcctcaagc tggaagatac ctctctggcc ccggcacatg tcacccctgc actcctgcct 180 

tcccgtgggc acttccacat cctctgggcc tctggcagtt cccagggact gttttcacct 24 0 

ctgctgtctc tggggtcagc tgctgctcat cagctgcccg ctagcatgtg gccaggggtg 300 



<210> 337 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 337 
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agacaaccca gaaacaaatt catacatcta tggtgaccac 
acatacactg gggaaaagat aatgtcttta ataaatggtg 
tatgcagaag aatgaaacta gacccccatc tcttagcata 
aaaaagttaa atctaagacc tcaaactatg aaacagctaa 
ctccaggaca ttggagtggg caaagatttc ttgtgtaata 

<210> 338 
<211> 292 
<212> DNA 

<213> Homo sapiens 
<400> 338 

tcaataacca tgaagatgca tcctaccacc gtcagggcaa 
actcgcatct tcatggttat tgagggcaag aaggctgccc 
agcttgaact tagaaaagaa agctcgtctg aaagaggaag 
gagtagcaga ggtatccgtg ttggctggat tttgaaaatc 
gcttgtatta aaaaggatgt ggtacgagga tccatttcat 

<210> 339 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 339 

gaaatttgca ctgatggctc agaaggctta cgtcatggag 
agggggggct ggaccaacct ggctttcccg actgctccat 
tgttcagctc cgaggccgcc tggcagtgtg tgagtgaggc 
tgggctacac aagggactat ccgtacgagc gcatactgcg 
tcttcgaggg aaccaatgag attctccgga tgtacatcgc 

<210> 340 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 340 

ctcagngcan cgatcatggc tcagtgcagc ctcaaactct 
acctcaacct cctgagtagc taggactata ggcacacagc 
ttattttgta gagatggggt ctcactatgt tgcccaggct 
aagcaatcct cccacctcgg cctcccaaag tgctgggatt 
ctggcccttg gtggaatctt tagggttttc tattcataca 

<210> 341 
<211> 296 
<212> DNA 

<213> Homo sapiens 
<400> 341 

atccaggtgt ttctgatgca cagtgaaatt ggggtaccac tggtattagg ttgggtatgg 60 
caactttttc atcacttgtt ttatgtagtt gtctgatcaa ttgtgaaaac ataatgaatg 12 0 
ttggaaatgg aacagtaaaa taacgaaagc caactttttt tttttttttt ttnnnnnnnn 180 



ttttgacaaa 


ggaatgaaga 


60 


ctgggaaaac 


tggatatcca 


120 


tacaaaaatc 


aaaattaatt 


180 


aagaaaacat 


cggggaatct 


240 


cctgacaaac 


aggcaaccaa 


300 



tcattagata gctgatcttc 60 

aaagacacga gactttaaca 12 0 

cagctatgaa ggccaaaaca 180 

caggaattat gttataacgt 240 

aaagtatgat tt 292 



agtatgacct 


acctcagagc 


60 


cgaggcagcc 


atggtgaagg 


120 


gctgcagatc 


ctcgggggct 


180 


tgacacccgc 


atcctcctca 


240 


cctgacgggt 


ctgcagcatg 


300 



tgggctcaan 


canagcgggn 


60 


accatgcccc 


ggctattttt 


120 


agtcttgaac 


tcctggcctc 


180 


aaaggcgtga 


gccaccgtac 


240 


tataaaatca 


tatcattggc 


300 
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nnnnnnnnnt tnncccccng ncngnanngc aggggcccaa nntnggntnn ntgnanccnc 24 0 

cnccnccggg ntnnncccct ttntcnngcc taacccnccc nagnacnngg aactac 296 

<210> 342 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 342 

ggcacgatca tggctcattg cagcctctaa ctccggggct caagcaatcc tcccacctca 60 
gcctaccaag tagctgtgac cacagctgcc cctcaccatg ctaagctaat ttttttaatt 120 
agatagtaca taaacgtccc aaaattagaa gataaaaaga catgagggat ccattctaat 180 
ttgtgtttgg agtgtaatgg tccagctcca ttcttctgca catggatatc cagttttaca 240 
caacactgtg aatgtaatga atgccactga atcatacact caaaaatagc taaaatggca 300 

<210> 343 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 343 

gttttcatca ctacatattc tacacacact gggaagctct gacaacttat tccctgctat 60 

tatcaactaa agatcaccct ttctactgct gtctctggag caggagctgg caaactatgg 12 0 

cctgctgtct gtttttgtac agttttactg aaacacagcc atgcccattt gtttactcat 18 0 

tgtctatggt tgctttcatg ccctcacagc aaaggcgagt agttgtgatg gatcaaatgg 240 

cccacaaagc ctgaaatatt tactctttga ccctttacag aaaaaaacct tgttgacccc 300 

<210> 344 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 344 

ccccaacctg cactctaccc acccccatca cctactccag ctcccaactt ttgtggactg 60 
agcggccgca gagactgggt cgccttggat tccctctgcc tccgaggacc ccaaaagaca 12 0 

cccccaaccc caggccagcc ggccctgctc tggcgcgtcc aaaatactac ctagcacagg 180 
cctctgctcg aggcaccccc aaactaccta tgtatccagc cccagagggc ctccattccc 240 
aggaagtccc tatgtatccc aacactggca gacacccagc accaccctcc cagacccgca 300 

<210> 345 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 345 

cccccatcac ctactccagc tcccaacttt tgtggactga gcggccgcag agactgggtc 60 
gccttggatt ccctctgcct ccgaggaccc caaaagacac ccccaacccc aggccagccg 120 
gccctgctct ggcgcgtcca aaatactacc tagcacaggc ctctgctcga ggcaccccca 180 
aactacctat gtatccagcc ccagagggcc tccattccca ggaagtccct atgtatccca 240 
acactggcag acacccagca ccaccctccc agacccgcaa gaaagtgaat ctcactacta 300 
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<210> 346 
<211> 300 
<212> DNA 

<213> Homo sapiens 

aaccagtggt ctatgtcccg 60 
tctgacctaa gaaacagtat 120 
ctgtgttttg agaacctgat 180 
aggttcaccc agaacctgtc 240 
acttatggcg tgaatagcaa 300 



<210> 347 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 347 

gctctgagcc caggcgaggc cagggacatg gccatggacc 
tgtgagttct acttcagcct ggacgccgac gctgtcctca 
atcctcattg aggagaacag gaaggtgatc agaccccatg 
gtggtccaac ttctggggcg ccctgagccc cgatgagtac 
cgtggagctg gtgcagcgga agcgagtggg tgtgtggaat 

<210> 348 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 348 

gttctgtggc tggcatggtc tgcctgctac tggagagatc 
attggtgctg tcatcttcct gggaatgctt gagaaagctg 
aatatccgat acaaaggaga atctgtccag ggtgctttga 
gcagtgaaac gctcactggc tcgaaccctg gtcatcatag 
gtcaagccac gccttggagt cactcttcat aatjgttgtag 

<210> 349 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 349 

gtcagctttt gatgaagcta tgtcatactg tcgatatcat ccttccaaag ggnattggtg 60 
gcacttcaaa gatcatgaag agcaagataa agtcagacct aaagccaaaa ggaaagaaga 120 
accaagctct atttttcaga gacaacgtgt ggatgcttta cttttagacc tcagacaaaa 180 
atttccaccc aaatttgtgc agctaaagcc tggagaaaag cctgttccag tggatcaaac 240 
aaagaaagag gcagaaccta taccagaaac tgtaaaacct gaggagaagg agaccacaaa 300 

<210> 350 
<211> 270 
<212> DNA 



<400> 346 
gtccacggtg ctgaacatca 
accactactt ggcttgatat 
tgtgaacagc cagccaccgg 
ggaaggcatc gagcgaaatc 
agcattccgt cgagaagtca 



tcatctttga agactgtagg 
tgcttaatga aaagtatttt 
agaagcagca ggccatgcac 
ttcttacgaa aaacagagac 
acgactcaat gaagaattcc 



tgtgtcggca ggaccccgag 6 0 

ccaacctgca gaccctgcgt 12 0 

ctgtcccgcc acggcaagct 18 0 

tacgcccgct ccgaggacta 24 0 

gtaccataca tctcccaggc 3 00 



tcctgagaat tcaggtttgg 60 

tcttctatgc ggaatttcag 120 

tccttgcaga gctgctttca 180 

tcagtctggg atatggcatc 240 

tagcaggagc cctctatctt 300 
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<213> Homo sapiens 



<400> 350 



ccatgctgnt aacgggtttc aaggggactc ttgaggaant 
gcaatanggn gggcttcctg tccccaggnc caccccacag 
tctgctangg agngantgna nnnnaccant aannnnnnan 
nnnnnncntn tnnccnannn ntannctncc ntannnnanc 
naacgnnnnn ttantgagan nttctcaact 



gccccctaaa atagaacaca 
tgctntntgg cactggnaac 
nnatcnacan nnnnnnnncn 
cnnccannan cactcncnat 



120 
180 
240 
270 



60 



<210> 351 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 351 

aaatgactcc ctgcaaaacc caacccatgc tgctggctgt gggatttttg gtgtaagcct 60 
atctatgcac tctatcagcc agaatttggc atttagctct tagttaaatc tagtaaagga 12 0 
cagtctattg tttaaagaga aggtgcattt gttcctcaat caagcaagag cacctgtgtt 180 
gtactgcttt atatctcatg tatatttata gtaatgaaaa gactttttaa attgtacacg 240 
tttcagtgcc tttcttgtgt tatgaaaggc aggtagatat tatagccata ggtaaaaatc 300 

<210> 352 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 352 

aagaaatgcc tctatgtagg tgaagtgttc tctctgcatg caacagtaaa aattaatata 60 
atattttccc cacaaaagaa acacttaaca gaggcaagtg caatttataa atttatatct 120 
aaaggggaat catgattata agtccttcag cccttggact ctaaattgag gggattaaaa 180 
agaatttaaa ataattttga acgaatttat tttcccctca gtttttgagg gcattaaaaa 240 
ggcattaaat caagacaaat catgtgcttg agaaaaataa aattaatgaa aacacagcac 300 

<210> 353 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 353 

cccacactcg gacactgtgg aattctacca gcgcctgtcg accgagacac tcttcttcat 60 
cttctactat ctggagggca ctaaggcaca gtatctggca gccaaggccc taaagaagca 120 
gtcatggcga ttccacacca agtacatgat gtggttccag aggcacgagg agcccaagac 180 
catcactgac gagtttgagc agggcaccta catctacttt gactacgaga agtggggcca 240 
gcggaagaag gaaggcttca cctttgagta ccgctacctg gaggaccggg acctccagtg 3 00 

<210> 354 
<211> 299 
<212> DNA 

<213> Homo sapiens 
<400> 354 
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gaaggaggac ctaggcacac acatatggtg gccacaccca ggagggtagt ggggagttag 60 

atttcagagt ccaggcccta ggttgggacc cactccaaat aatctcctcg gtgtgggtgg 120 

tggttctata gagggataaa tgaataataa acattgttaa aatatacgaa aaaaaaaaaa 180 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 24 0 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaacnncn ncnananaaa aaaaaaaaa 299 

<210> 355 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 355 

actgttcatc ctaagttcca ctataaacag gctcatgact cgggcacaga cacttcttgc 60 

gtgacttttt cctatgatgg taatgtcctt gcctctcgtg gaggtgacga ttcattaaaa 120 

ttatgggaca tccgacaatt taataaacca cttttttcag cctcgggtct tcccaccatg 180 

ttcccaatga ctgactgctg tttcagtcca gatgataagc tcatagtcac tggtacatct 240 

attcaaagag gatgtggcag cggcaaactt gttttctttg agcgtaggac tttccaaagg 300 

<210> 356 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 356 

ttcagaaaga aacatttaat agggacttac aaacaaatta atgtctgagt ctcaggtggc 60 
agcaagacaa gatggtggat ccccatgcca ttacctgcta gactcagggt ttatatactg 120 
tagtggagag gtgattccga aggaatgttg taagacaatt gaagagcagt aacatcaaag 180 
ttatttgacc taagggcagg agttacagta agtatccact tttatacaag aaacaataga 240 
taaactggaa atcttggagc ccttcctgga actggggtta atgagaagtc aacatggtgg 300 

<210> 357 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 357 

acaaaaccta cagatggaga taaaaattac tactgttatt caacatgtgt tccagaacct 60 

tattttgggg agtaaagtca attgggcaga ggatcctgcc cttaaggaaa ttgttctgca 120 

gcttgagaag aatgttgaca tgatgtaata agaattcatt tctgacatat tttacatttc 180 

•tggcaatctc aactcttatt tggaatactt ctgtgcattt gtctgtccac cgtaatttta 240 

gaaaagcata tccataacgt ttacagttgt agtacagttg tggttagtta tttgtagtgg 300 

<210> 358 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 358 

ggtgattaca gaagcccaga aggttgatac cagagccaag aacgctgggg ttacaatcca 60 

agacacactc aacacattag acggcctcct gcatctgatg gaccctgcac ttgatggacc 120 

agctggcacc acccagatca ataaactggc ttatttgaat ttgcggcccc ccacccagga 180 
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actgactcag tgcaagaaga cagcttcgac tccctgtgat ttcatctctg accaatccgc 240 
actcctggct cactggcttc cccaacccat gaagttttcc ttaaaaactc tgctcccgaa 300 

<210> 359 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 359 

atcaggtgtt cctcccatgg caggagggaa gaaacccagc aaacggccag cctgggactt 60 
aaagggtcag ttatgtgacc taaatgcaga actaaaacgg tgccgtgaga ggactcaaac 12 0 

gttggaccaa gagaaccagc agcttcagga ccagctcaga gatgcccagc agcaggtcaa 180 
ggccctgggg acagagcgca caacactgga ggggcattta gccaaggtac aggcccaggc 240 
tgagcagggc caacaggagc tgaagaactt gcgtgcttgn gtcctggagc tggaagagcg 300 

<210> 360 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 360 

tctgtctggt gatttttatt ttaagtgaac ctttggatct atctttaact ctctttattg 60 
tgagtggtaa attccaattc tgcagcagat cagtaaactc acagtatttt tcctgtggaa 120 
atctattcaa taaggaaacc aagacaggat aataaaattt aaaaaaaaac aactttgaat 180 
tcccctgcct aggtcttcca gttgttttcc agcgcatacc tcaggtatga ctttgctagc 240 
cggggacaaa attagcacct tccgattctc tagtccaaat gaactttgtg ctaaataaaa 300 

<210> 361 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 361 

gtagaacaga aaatgagcat ccgatttctt cactaaagga gaccaaactg ttccttgcgg 60 

tctagtattg aagaactgga acttgaaagt cctccttcta ccaactccac ctccaccccc 120 

tcattcccct tctcccaaag tactactgct gttgcatgac aaccccaaat atgttctgtc 18 0 

aacacaaacc tgcctttggt gtataaacag ggcattacag aatggtacac cctatatatt 24 0 

tctgttcagt atccattcac tagttcttca tttataaata tcatcttccc cattctgctg 300 

<210> 362 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 362 

actaccccgg ctacggttcc cccatgcctg gcagcttggc catgggcccg gtcacgaaca 60 

aaacgggcct ggacgcctcg cccctggccg cagatacctc ctactaccag ggggtgtact 120 

cccggcccat tatgaactcc tcttaagaag acgacggctt caggcccggc taactctggc 180 

accccggatc gaggacaagt gagagagcaa gtgggggtcg agactttggg gagacggtgt 240 

tgcagagacg caagggagaa gaaatccata acacccccac cccaacaccc ccaagacagc 300 
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<210> 363 

<211> 271 

<212> DNA 

<213> Homo sapiens 



<400> 363 

ggcaattagc ctcgcttaag ttgccttttt tacacaccaa aactttttac atgaagggct 60 

ggtttcacat gaatactata ctgaaatctg tgctctcaag atctagcagt gaccagggct 120 

gcccggcggg ggctctcctg gcaagtcagg aaggtnnnnn nnnnnnnnnn nnnnnnnnnn 180 

canattantn nctgatcntc tntnangaan nnngantngc tctnttggnc nttgtnnnnn 24 0 

gncntnnnnt naantntttn ntnatgtngc t 271 



<210> 364 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 364 

agaggaccct gcagttaggg ggtgttactt tgtcgcccag gatggcctgg acccccaggt 60 

tcagggattc tcccgccgct gcttcctgag tagctgggac ctcaggcttc cgcctcgtgc 120 

ccgcatccct gctgtgttta ggcagcaggt ggtgacctca ctcctccctg gcctgagctc 180 

tccgtcccgc atcccaggcg gaggccctag ggaacacttt gaagctgagc acggggtgga 240 

ccctccctcc tgagtgaatg gagaatagaa agggagagga tttctgttct gttctgtggg 300 



<210> 365 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 365 

gttcttcaaa gccaaccaag acaggcttag cagttttaga gcttcagaac aaattgccaa 60 

aagccagagt tgtttatgct agtgcaactg gtgcttctga accacgcaac atggcctata 120 

tgaaccgtct tggcatatgg ggtgagggta ctccatttag agaattcagt gattttattc 180 

aagcagtaga acggagagga gttggtgcca tggaaatagt tgctatggat atgaagctta 24 0 

gaggaatgta cattgctcga caactgagct ttactggagt gaccttcaaa attgaggaag 300 

<210> 366 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 366 

gccagtcctc accttcccta gtcctcgtgt gtattttagg agatgcgtgg gtgtggaaca 60 

gcctcctgcc tccggtccag gtgtactggg gtctgtgtgt tgtgtttctg cgtgttctcg 120 

gcagaaagtg gcatgctgtc ccgcctgggt gatttgctct tttacactat tgctgaagga 180 

caggaacgaa tccctatcca caagttcacc actgcactaa aggccactgg actgcagaca 240 

tcagatcctc ggctccgaga ctgcatgagc gagatgcacc gcgtggtcca agagtccagt 300 

<210> 367 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 367 



cattgccaga gagcggtttc agcaagctgc 
gaagtccatg tggggtcagt tctggtctgc 
agcatccaaa gttaaaaggg ttgtgcaact 
tgttgtaatt ggtctgcagt ctacaggaga 
cgggggagaa ttgaatgatt ttgtttcaac 



agatctgatt gatgctgagc aacgaatgaa 
tcaccagagg ttcttcaaat acttatgcat 
agctcgagag gaaatcaaga atggaaaatg 
agctagaaca ttagaagctt tggaagaggg 
tgccaaaggt gtgttgcagt cactcattga 



60 



120 



180 
240 
300 



<210> 368 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 368 

gcccggcccg gcgacgctgg cgacgctttc gcccctgagg tagtttggcg accgcgaaga 60 

aggaaaaagg gcgggcgggc ggctgtcctc tcaccgtcct caccccgcga ggcccggccc 12 0 

gctcctccgt cgtggatttc gcggcgatcc ccccggcagc tctttgcaaa gctgcttgaa 180 

acttctccca aactcggcat ggatacgact gcggcggcgg cgctgcctgc ttttgtggcg 24 0 

ctcttgctcc tctctccttg gcctctcctg ggatcggccc aaggccagtt ctccgcaggt 300 

<210> 369 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 369 

gtggggtgtg cctcgtgtgc gtggattcgt gtgtgtgtgt gtgtcttgta tatgtgtgcg 60 

cagagtgcat cattttcaga ctctactatt tccgtcaagt attctgtttg atttggatca 120 

tctcaggatc ggattctgtt ttagagtgtt tctgggccag gatccgggcc cctgccctcc 180 

tctgcacctg accacactcc ctactcaggg ctagtctgtt cttcccggac atcttctggt 240 

agccgtgcag gagagggctg ggtggggcag aggccacaga ggggacctgg tgtgtcacct 300 

<210> 370 
<211> 273 
<212> DNA 
<213> Homo sapiens 

<400> 370 

cagaggctgg ttcagaaaag gaggaagagg cccggctggc agccctggaa gagcagagga 60 

tggaggggaa gaagcccagg gtgatggcag gcaccttgaa gctggaggat aagcagcggc 120 

tggcccagga tgaggagagt gaggcctagc gcctggccat tatgatgatg aagaagctnn 180, 

nnnnnnnnnn nnnnnnnnnc atcatgtccn ntgcatggct acctatccca tatttnatnt 240 

ccctnncgtt gnttcnaatt ncacattntc ttt 273 

<210> 371 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 371 



88 



WO 99/33982 



PCT7US98/27610 



gatgaggagt gtttaatcat tgatacagaa tgtaaaaata atagtgatgg aaagacagct 60 

gttgtgggtt ctaacttaag ttccagacca gctagtccaa attcttcctc aggacaggct 12 0 

tctgtaggaa accagactaa tactgcttgt agtcctgaag agtcatgtgt tttaaaaaaa 180 

cctatcaaac gagtatataa aaaatttgat ccagttggag agattttaaa aatgcaggat 24 0 

gagctcttaa agccaatttc cagaaaagta ccagaattgc ccttaatgaa tttagaaaat 300 



<210> 372 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 372 

gggccccaat gcagctgccc tctccagata cctggcagcc tcatatatca gccaaagcct 60 

ggctcggcgg caggggcctg ggggaggggc ccccgcagcc tcccggggct cctggtcctc 120 

tgctcccacg tcacgggcat cttcgccgcc cccccagccc cagccaccac ctcccgcagc 180 

caggcggctc agctatgcca cgacggttaa catccacgtg ggcgggggtg ggcggctgcg 240 

gccagccaag gcccaggtcc ggttgaacca ccctgctctc ttggcctcca cacaggaatc 300 

<210> 373 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 373 



<210> 374 
<211> 300 
<212> DNA 
<213> Homo sapiens 



tggtgttctt 


tggtttgctt 


tcttcaggct 


60 


catgcggctg 


tttaaagtta 


agccaattaa 


120 


gttgcactaa 


ccacgtttct 


agaggcgtca 


180 


agcgagagca 


tgtccatctg 


ttggacagca 


240 


gtcacagcct 


cagctgtgct 


gggacgaccc 


300 



<400> 374 

tcaaggccta cgaacaggtg atgcactacc ccggctacgg ttcccccatg cctggcagct 60 

tggccatggg cccggtcacg aacaaaacgg gcctggacgc ctcgcccctg gccgcagata 120 

cctcctacta ccagggggtg tactcccggc ccattatgaa ctcctcttaa gaagacgacg 180 

gcttcaggcc cggctaactc tggcaccccg gatcgaggac aagtgagaga gcaagtgggg 240 

gtcgagactt tggggagacg gtgttgcaga gacgcaaggg agaagaaatc cataacaccc 300 

<210> 375 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 375 

cttcagtgca cacaacagga gagaggagaa agaagaaacg ctagtaattc caagcactgg 60 

aattaagttg ccttcatcag tgtttgcttc agagtttgag gaagatgttg tgattgttaa 120 

ataaagcagc tccagtttca ggacctcgac tggattttga tcctgacatt gttgcagctc 180 



89 



WO 99/33982 



PCT/US98/27610 



ttgatgatga ttttgacttt gatgatccag ataatctgct tgaggatgac tttattcttc 24 0 
aggccaataa ggcaacagga gaggaagagg gaatggatat acagaaatct gagaatgaag 300 

<210> 376 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 376 

gggagactgg ggtctatttc acccctgcag tctcgaccat aagagatggc tacacccagg 60 

ggggccagtt cagagaccca ctcccaggtg tgcattctct ttctcaagga tgttccttgc 120 

tgagaaaaag aattcagtga tatttctccc atttgcttgt gaaagaagag aaatgtggct 180 

ttgttccacc tggctcaccg gcggtcagaa tttaaggtta tctctcttgt ttcctaaaca 240 

ttgctgttat cctgttcttt tttcaaggtg cccagatttc atattgctca aacacacatg 300 

<210> 377 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 377 

gatcagccca cctcggcctc acaaagtgct gggattacag gcgtgagcca ccttgcccag 60 

cccacatcat acagtttgaa atgaaacttt gccacaacca gcctttgctg tagcacacac 120 

atatatcact gaacctgttt gaaataaagt tttttttctt tttcctctgg tattctgggt 180 

tctgaagtct ggtattctgg tattctgggt tcaaaagtat gacttgagag tgttgctctg 240 

gtattctgag agttgctctg tattctgggt tctgaagatt atttgaaaaa taactcctac 300 

<210> 378 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 378 

tcgctgtgat ccaaggataa aaaagttcaa ggaagaagaa aaagccaaga aagaagcaga 60 

aaagaaagca aaagcagaag ctaaacggaa ggagcaagaa gctaaagaaa aacaaagaca 120 

agctgaatta gaagctgctc ggttagctaa ggagaaagaa gaggaggaag tcagacagca 180 

agcattgctg gcaaagaagg aaaaagatat ccagaaaaaa gccattaaga aggaaaggca 240 

aaaacttcga aactcatgca agacctggaa tcatttttct gataatgagg cagagcgggt 300 

<210> 379 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 379 

acactataga atacaagcta. cttgttcttt ttgcaggatc ccatcgattc gaattcggca 60 
cgaggcagct tcgagccaat ggtgagctcc ttctggatca gctccttcag ctccttcttg 120 
ctcaggatgc tgaaattgca aggctgatgg aagacttgga ccggaacaag gaccaggagg 180 
tgaacttcca ggagtatgtc accttcctgg gggccttggc tttgatctac aatgaagccc 24 0 

tcaagggctg aaaataaata gggaagatgg agacaccctc tgggggtcct ctctgagtca 300 



90 



WO 99/33982 



PCT/US98/27610 



<210> 380 
<211> 296 
<212> DNA 

<213> Homo sapiens 
<400> 380 

acctggacag ggccagctgc tgggggagcg gcactgggga ctggaggctg gaagcgggtg 60 
gtgtgtgtcc cctgtttact tttagctgag ctggggttgg gtgtacgggt tctgttcctc 120 
tgagccctgc ggcccacctg atgtttacgt gtgtgtgtga gggggggcnn nnnnnnnnnn 180 
nnnnnnnnnn ngtnatangc ttaacanatg nanagncnac tnactnctga ttntttatnc 240 
atttgtgcat tnnaactatg cttttncgat cttnctgntg nnatnacngg catgat 296 

<210> 381 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 381 

cagaaaagag tatagtaggg atgaccaagg tcaaagtggg taaagaagac tcatcatcca 60 
ctgagtttgt agaaaaacgg agagcagctc ttgaaaggta tcttcaaaga acagtaaaac 120 
atccaacttt actacaggat cctgatttaa ggcagttctt ggaaagttca gagctgccta 180 
gagcagttaa tacacaggct ctgagtggag caggaatatt gaggatggtg aacaaggctg 240 
ccgacgctgt caacaaaatg acaatcaaga tgaatgaatc ggatgcatgg tttgaagaaa 300 

<210> 382 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 382 

gccaccggtc tcttcctaat ctgcacagac tattttgggt atttctgggc gggcagttcc 60 
tttgcatgtt tcgggagagg tttgctgatt tggggcttat atgtcaggcc tttggtttgc 120 
gtcttatttt aggggttgtt tgggggcctg ggtggtcggc ctcacatggg aaggggatgg 180 
gtagtggatg gggtttctgt tgtatcttgt gggcgggtga ttttgctttt gtttttgttt 240 
cacattcttc cccctccaca agccaaagtc gtttcatttg gtttccactg tgtggactgt 300 



<210> 383 
<211> 273 
<212> DNA 

<213> Homo sapiens 
<400> 383 

gagatttgat attcgagtgc tgggcttagg tctgttgata aatctagtgg agtatagtgc 60 
tcggaatcgg cactgtcttg tcaacatgga aacatcgtgc tcttttgatt cttccatctg 120 
tagtggagaa ggggatgata gtttaaggat aggtggnnnn nnnnnnnngc cngcnttnac 180 
ttnatngcnn ctttttcttg atcnacgncn gnnatncnna nnngtntata ntaatncnga 240 
ananttnttt gnnntgcttt atcaantntt cnt 273 

<210> 384 
<211> 259 
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<212> DNA 

<213> Homo sapiens 



<400> 384 

aagagaagga cctagagatt gagaggctta agacgaagca aaaagaactg gaggccaaga 60 

tgttggccca gaaggctgag gaaaaggaga accattgtcc cacaatgctc cggccccttt 120 

cacatcgcac agtcacaggg gcaaagcccc tgaaaaaggc tgtggtgatg cccctacagc 180 

taattcagga gcaggcagca tccccaaatg ccgagatcca catcctgaag aataaaggcc 240 

cgaagagaaa gctggagtc 259 

<210> 385 

<211> 296 

<212> DNA 

<213> Homo sapiens 



<400> 385 

agagcctgca agtgacaaag gaagtgaggc agaggcccac atgcccccac cgttcacacc 60 

ctacgtgcct cggattctga acggcttggc ctcggagagg acagcactgt ctccgcagca 120 

gcagcagcag cagacctatg gtgccatcca caacatcagc gggactatcc ctggacagtg 180 

cttggcgcat agcgccacgg gcagtgtggc ttgctgcccc ccaggaggcc tgaggctggg 240 

tctcactgct ctgaaaaaga cccnnccaaa atgggccttg gggctnnagg cccttg 296 



<210> 386 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 386 

gaagaggagg ctgtgtatga ggaacctcca gagcaggaga ccttctacga gcagccccca 60 

ctggtgcagc agcaaggtgc tggctctgag cacattgacc accacattca gggccagggg 120 

ctcagtgggc aagggctctg tgcccgtgcc ctgtacgact accaggcagc cgacgacaca 180 

gagatctcct ttgaccccga gaacctcatc acgggcatcg aggtgatcga cgaaggctgg 240 

tggcgtggct atgggccgga tggccatttt ggcatgttcc ctgccaacta cgtggagctc 300 



<210> 387 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 387 

ccgcagaggg cctggaagag gtgct caeca cgccagagac tgtgctcaca ggccacacgg 60 

agaagatctg ctccctgcgc ttccacccac tggcagccaa tgtgctggcc tcgtcctcct 120 

atgacctcac tgttcgcatc tgggaccttc aggctggagc tgatcggctg aagctgeagg 180 

gccaccaaga ccagatcttc agcctggcct ggagtcctga tgggcagcag ctggccactg 240 

tetgeaagga tgggcgtgtg egggtctaca ggccccggag tggecctgag cccctgcagg 300 



<210> 388 

<211> 300 

<212> DNA 

<213> Homo sapiens 
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<400> 388 



tggaggtctc ctttcgcccc agcccaggtg gccaagccca 
gagcacattt tgtagggtgg caccttttta tccaagttac 
aagagaaaaa agtgaccttt catttttttt tcttgaaact 
tactgatttt ttttttctta aaactaaatg catgactgca 
tttcatactg tggggcaaag tatttgtgct gctttttgga 



tcctggcctc agaacatgct 
tagctacaca tcagtgttta 
tgaggaaaca agatacatac 
gagcggtaga ggtgtatatt 
gatggactgg aacgtctggt 



60 
120 
180 
240 
300 



<210> 389 
<211> 293 
<212> DNA 
<213> Homo sapiens 

<400> 389 

gtcaagctgg ccctggatgt ggagatcgcc acctaccgca agctgctgga gggcgaggag 60 

tgcaggctga atggcgaagg cgttggacaa gtcaacatct ctgtagtgca gtccaccgtc 120 

tccagtggct atggcggtgc cagcggtgtc ggcannnnnn nnnnnnnnnn nnnatgaanc 180 

agntactcct atggnnttag cnttntanct atnacctgcn cnaactannc tnangtgcta 240 

gnncttgccc caacccctac ttttgtattt atattgtgtg tgcgtgtgtg cgt 2 93 

<210> 390 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 390 

ctcacacctg ctttggatgc ttcaagcacc tcagccctct gaactacaaa acagangagc 60 

ctgcaagtga caaaggaagt gaggcagagg cccacatgcc cccaccgttc acaccctacg 120 

tgcctcggat tctgaacggc ttggcctcgg agaggacagc actgtctccg cagcagcagc 180 

agcagcagac ctatggtgcc atccacaaca tcagcgggac tatccctgga cagtgcttgg 240 

cgcagagcgc cacgggcagt gtggctgctg ccccccagga ggcctgaggc tgggtctcac 300 

<210> 391 
<211> 257 
<212> DNA 
<213> Homo sapiens 

<400> 391 

acccgtccgg ggccggccaa tttgcatatt tggaatgcgc cgctataaac ccggctgggg 60 

ttttgcagcg atttcttaga tgtaaaaatg agatctcaat agcagcgggc tgggcacatc 120 

ctctcctctc tccttctctc tctgcccgga gctggtttcc gtctctcggc tcggggctgg 180 

aactccggcc caacctaggc gcgcagccgc cacgagatgg cgcacttccg atcaatgtca 240 

aagccgccgg ggagccc 257 

<210> 392 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 392 

gcgcgagcgt cggctccgcc tgggcccttg cggtgcgctg cgggcaggcg gtgaggctca 60 
cgcatgtgct tacgggcaag aacctgcaca cgcaccactt cccgtcgccg ctgtccaaca 120 
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accaggaggt gagtgccttt ggggaagacg gcgagggcga cgacctggac ctatggacag 180 
tgcgctgctc tggacagcac tgggagcgtg aggctgctgt gcgcttccag catgtgggca 240 
cctctgtgtt cctgtcagtc acgggtgagc agtatggaag ccccatccgt gggcagcatg 300 

<210> 393 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 393 

gcgcgagcgt cggctccgcc tgggcccttg cggtgcgctg cgggcaggcg gtgaggctca 60 
cgcatgtgct tacgggcaag aacctgcaca cgcaccactt cccgtcgccg ctgtccaaca 120 
accaggaggt gagtgccttt ggggaagacg gcgagggcga cgacctggac ctatggacag 180 
tgcgctgctc tggacagcac tgggagcgtg aggctgctgt gcgcttccag catgtgggca 240 
cctctgtgtt cctgtcagtc acgggtgagc agtatggaag ccccatccgt gggcagcatg 300 

<210> 394 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 394 

gtccatacat ggagctccct ggagcccgtg tgctctcgtg 
aaaggaggag aggctgtctg cctttatgag gagccagtgt 
gggaattgca cacgggaaag ctgtgtggtt tccttttacc 
ctgagcccga ccaactacca cttcctgtcc tcaccgaagg 
gcgcagatca ctgccatcat ctctcacnag gngaccatat 

<210> 395 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 395 

gcaaaatcaa tgtggactga acataaatca cctgatggaa 
gaaaccaaac agtctacctg ggagaaacca gatgatctta 
ttatctaaat gcccctggaa ggaatacaaa tcagattctg 
tctcaaacaa aagaatctcg ctgggccaaa cctaaagaac 
atcaaagctg aagaaagcag taagcaagaa gagtgcacca 



tgactgaacg 


ttttgtgatg 


€0 


ctgaattgct 


gaggagatgt 


120 


tttcagctga 


ccatgaactc 


180 


aggccgtggg 


gctctgcaag 


240 


tggtttttga 


cctggagacc 


300 



ggacttacta 


ctacaacact 


60 


aaacacctgc 


tgagcaactc 


120 


gaaagcctta 


ctattataat 


180 


ttgaggatct 


tgaagcaatg 


240 


caacatcaac 


agccccagtc 


300 



<210> 396 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 396 

aagagcacaa gaggaagaga gagaccctca ctgctgggga gtccctgcca cactcagtcc 60 
cccaccacac tgaatctccc ctcctcacag ttgccatgta gaccccttga agaggggagg 120 
ggcctaggga gccgcacctt gtcatgtacc atcaataaag taccctgtgc tcaaccaaaa 180 
aaaaaaaaaa aaaaacnnnn nnnnnnnnnn nntntngggn gnctnntnnc nnaaanccan 240 
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ncttnataaa anccttngnt natttggaac aacccncann taaanngcag ggaaaaaaag 300 

<210> 397 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 397 

gataaatacc tcagcccctc gccttcctca acccacctgg caagtcttct taggatctga 60 

tcccagtttt ctggaagcaa tcctacccca gcccaagctt cccagagtcg agccttaatc 120 

cttctcactt ctcagtgtca gagcagaaat gaatcctggg gttgactgtg tccattcggg 180 

ttattagcag ctaagaagcc cagacgagta gtgtgagctg ccttgggagc ctcagtgagg 240 

gcactgggac tggcctcact ctcttgcccc cagcctagtg ggctttctcc tctgtctctc 300 

<210> 398 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 398 

ctgaacccta aaggaaagcc agcaaaccag ctgcttgctc tcaggacttt ttgcaattgt 60 

tttgttggcc aggcaggaca aaaactcatg atgtcccaga gggaatcact gatgtcccat 120 

gcaatagaac tgaaatcagg gagcaataag aacattcaca ttgctctggc tacattggcc 180 

ctgaactatt ctgtttgttt tcataaagac cataacattg aagggaaagc ccaatgtttg 240 

tcactaatta gcacaatctt ggaagtagta caagacctag aagccacttt tagacttctt 300 

<210> 399 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 399 

gctgacctac agcagaagct gctggatgca gaaagtgaag acagaccaaa acaacgctgg 60 

gagaatattg ccaccattct ggaagccaag tgtgccctga aatatttgat tggagagctg 120 

gtctcctcca aaatacaggt cagcaaactt gaaagcagcc tgaaacagag caagaccagc 180 

tgtgctgaca tgcataagat gctgtttgag gaacgaaatc attttgccga gatagagaca 24 0 

gagttacaag ctgagctggt cacaatggag caacagcacc aagagaaggt gctgtacctt 300 

<210> 400 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 400 

ggctagcgat ttctacctgc gctactacgt agggcacaag ggcaagtttg ggcaccgagt 60 

ttctggagtt cgaatttcgg ccggacggaa agcttagata tgccaacaac agcaattaca 120 

aaaatgatgt gatgatcaga aaagaggctt atgtgcacaa gagtgtaatg gaagaactga 180 

agagaattat tgatgacagt gaaattacaa aagaagatga tgctttgtgg cctccccctg 240 

atagggttgg ccgacaggag cttgaaattg taattggaga tgagcacata tcttttacca 300 

<210> 401 
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<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 401 



acccccttca tggacagatc ccccacagcc 
agtggcggca gcagcagccg cagcagccaa 
tggaggggtc ccggaagagc tggtccccgt 
agaggccata gccccaggct cagaggccca 
gcccccaatg gtgcagctgc agcagtcacc 



tggggcagaa gaggcgtcga gggcgccaga 
agagaggcaa gagaaagaga aagcgggcgg 
ggttgagctg gtccccgtgg ttgaattgga 
gggcgctggg tctggtgggg acgcgggggt 
actagggggt gatggagagg aagggggcca 



60 
120 
180 
240 
300 



<210> 402 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 402 

ggatcctttc cagacagaag accccttcaa atctgaccca tttaaaggag ctgacccctt 6 0 

caaaggcgac ccgttccaga atgacccctt tgcagaacag cagacaactt caacagatcc 12 0 
atttggaggg gaccctttca aagaaagtga cccattccgt ggctctgcca ctgacgactt 18 0 

cttcaagaaa cagacaaaga atgacccatt tacctcggat ccattcacga aaaacccttc 24 0 

cttaccttcg aagctcgacc cctttgaatc cagtgatccc ttttcatcct ccagtgtctc 300 

<210> 403 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 403 

aattccgttg ctgtcggaca gattgcccta gtacccaccc acctatcagg gttatgcaat 60 
ggaacatcct cgcccaagct cttggagaag gcaaagacaa ctttgtacag tgccctgttg 120 
aagcactcaa atgggaagaa aggaaatgtc tcatcctgga agaaatcctg gcctaccagc 180 
ctgatatatt gtgcctccaa gaggtggacc actattttga caccttccag ccactcctca 240 
gtagactagg ctatcaaggc acgtttttcc ccaaaccctg gtcaccttgt ctagatgtag 300 

<210> 404 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 404 

agtgggataa aatgagacga gccctggaat ataccattta caatcaggaa cttaacgaga 60 
ctcgtgccaa acttgatgag ctttctgcta agcgagagac tagtggagaa aaatccagac 120 
aattaagaga tgctcagcag gatgcaagag ataaaatgga ggatatcgaa cgccaagtta 180 
gagaattgaa aacaaaaatt tcagctatga aagaagaaaa agaacagctt agtgctgaaa 24 0 
gataagagca gattaagcag aggactaagt tggagcttaa agccaaggat ttacaagatg 300 

<210> 405 

<211> 856 

<212> DNA 

<213> Homo sapiens 
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<400> 405 

tggtgccngt tcctattccg tgctntcgtn ctncnccagg ancnangcgt ntcgaattcg 60 

gcacgaggaa ggaggaccta ggcacacaca tatggtggcc acacccagga gggtagtggg 120 

gagttagatt tcagagtcca ggccctaggt tgggacccac tccaaataat ctcctcggtg 180 

tgggtggtgg ttctatagag ggataaatga ataataaaca ttgttaaaat atacgaaaaa 240 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaan aaaaaaaana aaaaaaaaaa anaanaaaaa 300 

aaaananaaa aatnaaaaan annanaaaaa aaaaaaaaaa aanncccctn cnccctaaaa 360 

nattcngggg ggntttttcc tccannccnn ntntttaata nnctncttnt tgnntcttng 420 

nctcaccnnt tcttttggtn ggcnntaana naaaatnttn nttttttttn ggntanaaat 4 80 

ncnntnncng ttttttntnn ttttttttcn aaaccctcct nttntanctc ncgtntcnaa 540 

aaanntnttt ntccncnncn nttnnntnnt nctntttcta tttttnnttc ttntncaann 6 00 

ttccnangtg nnnngngtnt nntgnggctt gtttnttttt ncnncctngc gtcatccnnc 660 

caataatttc ttnncncccc nannccnnat tttttntnnc ctctatntnn gnngngnnat 720 

atnantcccc tttattnttn atnantagtc ntntnttttn ttntccntng tnatannatt 780 

ttntntcccn ntntaanttc ctcannnnat ttnntnnncn ncgngntata tttnangnta 840 

nntcnncggg gttnct 856 

<210> 406 
<211> 843 
<212> DNA 
<213> Homo sapiens 

<400> 406 

tnntnnnnnc gnangctggn nnnttctncc cntttctaat ngttnctaat actanggatn 60 

gtcacgaggn tcccangtag gcatagcgca ctgctgtacg tttttggttg tttttaagaa 120 

actcgatgaa gaggggtgtc attctgggct cggggtggtt gccaattttt caccagaaag 180 

ggagccaccc cttgcaacca cttctgtctc cgttagcccc ccctctgccc tcctccaagc 240 

caaagcgtgg cctggctttt gtcttcccat ttagttttcc tcttttaccc ttccttttgt 300 

gcttaattta ttaaaatagt tgctgtataa tttattttca taaactataa aaaaatacta 360 

aatggttaaa atagacttgc aggccaatct taaatggggt gggaggggtc tgagggtggg 420 

atggggaaag ggaaagaggt tttgatntaa acaaaacaaa tgcactttgg gtgtgtnnng 4 80 

gnattttnct ggggatanan ggggtggggg nnnnngnann nnnnnnnnnn nnnnnnnnnn 540 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 660 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 720 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 780 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 840 

ncn 843 

<210> 407 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 407 

tgggcnggnn ctantngngg gctctcgact tcntacganc ttccaatggt tnggnggcac 60 

gagcccccat cttcactggt tattccactt atttaaaatg tccagaataa gcaaatctcc 120 

atatagagga agtagattag tggttgcttc gggatgggag gaatgggaag attgaggtct 180 

ttcttttgca gtgataaaaa tgtcctaaaa ttgactgtag cgatggtcac acaactctga 240 

atatgcttaa gaccattgaa ttacacactt tacgttggtg aattgtatgg tatgtaaatt 300 

atagttcaat aacatagtta caaaagataa tcaaaagcat gaaagcactg ttgatgtggt 360 
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ttggatctgt gtcctcaccg agtctcatgt tgaaatgtaa gccccctggt gggaggcgat 420 

gggattatgg ggcagagtcc tcacaaacgg tttacaccac ccgctcagtg ctggtctcct 480 

gatattgagt cctcatcaca tctggttgct tcaaagtgtg tggtgcctcc cctctatctc 540 

cctnctgctc tggccatata agatgtgcct gcttctcttc gccttctaac atgattgnaa 600 

gtttcctgag gcctncctag aacaaaactg ctgtgctttc tgnncccatc tacaggaccc 66 0 

ggagccaatt naaccccttt tctttataaa aaaaaannnn nnnnnnnnnn nnnnnnnnnn 720 

nnnnnnnnnn nnnnnnnnnn nnt 74 3 

<210> 408 
<211> 746 
<212> DNA 

<213> Homo sapiens 
<400> 408 

tgtccgnttc ttangntgng ctctngcttc ctaggagtnt cnaatcgctt ggtgcagacc 60 

tccagcacaa gcctctgcta gtngatctca cggtagaaga aggtcaaaga ttaaaggtna 120 

tttttggttc acacactggt ttccatgtaa tngatgttga ttcaggaaac tcttatgata 180 

tctacatacc atctcatatt cagggcaata tcactcctca tgctattgtc atcttgccta 240 

aaacagatgg aatggaaatg cttgtttgct atgaggatga gggggtgtat gtaaacacct 3 00 

atggccggat aactaaggat gtggtgctcc aatggggaga aatgcccacg tctgtggcct 3 60 

acattcattc caatcagata atggctgggg cgagaaagct attgagatcc ggcagtggaa 420 

caggacattt ggatggagta ttnatgcata agcgagctca aaggttaaag tttctatgtg 48 0 

aaagaaatga taaggnattt tttgcattcc gtgcgatctg gaggaagtag cccaagtgtt 540 

tttcatgacc ctcaacagaa attccatgat gaacctggta accagaagaa ccccttggca 600 

cttatcttca tggcgttatt ctaatttaaa aagaacataa ctcatgngga cttatgccca 660 

gtctagaggc agaatcagaa ggcttgggtg gaacatatcg ntttcctttt tcctttcctt 720 

cggccctncc agnccagtcc atnttt 746 

<210> 409 
<211> 761 
<212> DNA 

<213> Homo sapiens 
<400> 409 

ggatccggtt tccaatgctc gggcnctcga gctncctaag annttgctaa tgcttggngg 60 

ngtgcctcgt gtgcgtggat tcgtgtgtgt gtgtgtgtct ngtatatgtg tgcgcagagt 120 

gcatcatttt cagactctac tatttccgtc aagtttctgt ttgatttgga tcatctcagg 18 0 

atcggattct gttttagagt gtttctgggc caggatccgg gcccctgccc tcctctgcac 24 0 

ctgaccacac tccctactca gggctagtct gttcttcccg gacatcttct ggtagccgtg 300 

caggagaggg ctgggtgggg cagagccagg aggggacctg gtgtgtcacc tgcccaccac 360 

ctggctcatc cctcangccc accctgaccc tacattacat aggttacgtc agcctactgt 420 

ggctgttgag caaagcattt ctcctttctn gggcctcatt gcactagatg ggcctgtggt 480 

cccaaagtag gtcagtaggt tggggttgct gacacccctt gggtgcaact ttgggacaag 540 

atgaantggc tctgtcctgt cactggcctc tccttgcctg ggggctatgt gcacttcaaa 600 

accctggcca agctcaagcc catgaagnat tggagaacac cctgggcccc caagaactgg 660 

angcacccgg ccanttcccc tgggattcca nctttgccan ggtgaaccct tcttttaccc 72 0 

naaacttntg tccccctgnt tccacttcca aaaanaactg g 761 



<210> 410 
<211> 748 
<212> DNA 
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<213> Homo sapiens 
<400> 410 

gatgccggtt cctatgatgn gctctcggct tcctaggagt tccaanactn ggctngcncg 60 

aggncttnta aatatatctn ggntttanta ggtgataagt nctgtcantt agtancatct 120 

gaaaaancag ctttgtcctg ggtgaaaaag gatgccaaaa ttgcctggaa aagagcagtg 180 

anaggagtcc gggagatgtg tgatgcntgt gaagcancat tgtttancat tcactgggtc 24 0 

tgccaaaaat gtggatttgt ggtctgctta gattgttnca aggcaaagga aaggaagagt 300 

tctagagata aagaactata tgcttggatg aagtgtgtga agggacagcc tcatgatcac 360 

aaacntttaa tgccaaccca aattatacct ggttctgttt tgacagatct tctagatgcc 420 

atgcacactc ttagggaaaa atatggtatt aaatcccatt gncattgtct aacaaacaga 48 0 

atttacaagt tggaaatttt cctncatgaa tggtgtatct caagtttaca gaatgtctta 54 0 

atcacagtat aaaattctct gngcatgcct gagtctcagc gccaaaatcc tcctccgaag 600 

tctgagaaaa atggtggcag cnnccccana aagtgatgtt nggcnccaga ttaccaggtt 660 

aacttcctcc agaatnccag tcaccactgn actggntagc anatcttgcc gagccaaaaa 720 

gcccnaagng ggaaaaaaaa aaaaaaaa 748 

<210> 411 
<211> 773 
<212> DNA 
<213> Homo sapiens 

<400> 411 

gnangnnngn ttcntagtgc ccgtgggagt cttagatncc ctaaaaaatt gntaatgctn 6 0 

ggtcggcacg agtcaaggcc tacgaaacag gtgatgcact accccggcta cggttccccc 12 0 

atgcctggca gctnggccat gggcccggtc acgaacaaaa cgggcctgga cgcctcgccc 180 

ntggccgcag atacctccta ctaccagggg gtgtactccc ggcccattat gaactcctct 240 

taagaagacg acggcttcag gcccggctaa ctttggcacc ccggatcgag gacaagtgag 300 

agagcaagtg ggggtcgaga ctttggggag acggtgttgc agagacgcaa gggagaagaa 360 

atccataaca cccccacccc aacaccccca agacagcaat cttcttcacc cgcttgcaac 42 0 

ccgttccgtc ccaaacagag ggccacacag ataccccacg ttctatataa ggaggaaacc 480 

gggaaaagaa tataaagtta aaaaaaaagc ctccggtttc cactactgng tagacttcct 54 0 

gcttcttcaa cacctgcaga ttctgatttt tttgttgttg gttgttctct ccattgctgn 600 

tggtgcangg aagtcttact taaaaaaaaa aaaattttgn gagtgactcg gtgtaaaacc 66 0 

atgttanttt taacagaacc nanaagggtt gncctattgg ttaaaaaaaa aaaaaaaaaa 720 

aaacttngng cctttagaac tattanngag nccnatttac nttaatccan net 773 

<210> 412 
<211> 774 
<212> DNA 
<213> Homo sapiens 

<400> 412 

gnanncegga ttcntagcgn tcgtggaagt geateggctg ntaacaattt getaatgett 60 

ggagttccaa ttccagagaa aatgagtgaa tgggcacctc gacctccccc agaaatttgt 120 

ccgagatgtc atgggttcaa gtgctggggc cggcagtgga gagttccacg tgtacagaca 180 

tctgcgccgg agagaatatc agegacagga ctacatggat gccatggctg agaagcaaaa 240 

attggatgea gagtttcaga aaagactgga aaagaataaa attgetgeag aggagcagac 300 

cgcaaagcgc eggaagaage gecagaagtt aaaagagaag aaattactgg caaagaagat 360 

gaaacttgaa cagaagaaac aagaaggacc cggtcagccc aaggagcagg ggtccagcag 42 0 

etctgeggag gcatctggaa cagaggagga ggaggaagtg cccagtttca ccatggggcg 480 
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atgacaatgt ttgccacagc ctctgcctgg aacctggctc gtgctgtgac cagaagggaa 54 0 

aggcggctgt ttggctcttt cttccccgca aggaccccgc ttacccgctg gatggagagc 600 

aaaggagacc cccttccgag cccgntcaca gtcctgtatt tggcaaggtt tgggaacctg 660 

aaggggccaa tntnccttga cacttanang cacttgcctt tcagacacca ttccgngcnt 720 

ctggtaaaag gggacaagaa aagccttaac cttggcnnca tattttgaca gggg 774 

<210> 413 

<211> 773 

<212> DNA 

<213> Homo sapiens 



<400> 413 

gnngnnnnnn tttctaatgc ttgggnnnnn ngtcnatgcn taagagccan gcggntcgaa 60 

ttcggcacga ggcgggcccg gccagcggaa gcccctgcgc ccgcgccatg tcaaagaaaa 120 

aaaggactga gtgcagaaga aaagagaact cgcntgatgg aaatattttc tgaaacaaaa 180 

gatgtatttc anttaaaaga cttggagaag attgctccca aagagaaagg ctttactgct 240 

atgtcagtaa aagaagtcct tcaaagctta gttgatgatg gtatggttga ctgtgagagg 300 

atcggaactt ctaattatta ttgggctttt ccaagtaaag ctcttcatgc aaggaaacat 360 

aagttggagg ttctggaatc tcagttgtct gagggaagtc aaaagcatgc aagcctacag 420 

aaaagcattg agaaagctaa aattggccga tgttgaaacg gaagagcgac caggcttagc 480 

aaaagacttt cttcacttcg agaccaaang ggaacagcta aaggcagaag tagaaaaaat 540 

ncaaagactg tgatcccgca agttgtngga agaaatcgcc aagcaaatna agtagcccaa 600 

ggaactgctt acagatggac tgattacata ttcgcaataa aatcttnggc ccaaagaaaa 660 

atttngggtt tgaaggaaaa ttaaattggt tngaaccttt tggaatttcc cgaaagactt 720 

ttgcctncnt ngacttaaaa tatttccatg gnggtgaaag gttgtccaan ctt 773 



<210> 414 

<211> 755 

<212> DNA 

<213> Homo sapiens 



<400> 414 

gnagnnnnnn nttctaatgc ttggggnnnn nngtcaatnc ctnngancna ggcggntcgc 60 

tcatccagaa angtcagatc ancaaagaag tccangaaaa antgcgaccc agctngaccn 120 

tttgatccca ggcttagcac acgattgcat ggcntcccct ttagccactt naaccactgc 180 

agacntccag gaagctggac tctctcctca gtccntccag acttctggcc accacagant 24 0 

gaaaacccca ttttcaactg agctatcttt gctccagcct gatactccag actgtgctgg 300 

agatagtcat accccactgg ctttttcctt caccgaggac ttggaaagtt cttgtttgct 360 

agaccgaaag gaagaaaaag gggattctgc caggaaatgg gaatggcttc atgagtctaa 420 

gaagactatc agagtatgga gaaacacacc aaactacctg gggacaaatg ctgtcagccc 480 

ttaggcaaga ctaaattgga aagaaaggtg tctgccaaag aaaacaggca ggcccctgtc 54 0 

ctccttcaaa catacaggga atcctggaat ggagaaaaca tagaatcagt gaaacaaacc 600 

cgtagtccag ttctgngttt tcctgggata tgaaaagaat gaccanggac tnctggagtc 660 

aacttttcac ttgaagaatc tcaaggccac cggtcattgg ccacacactn gaactccttt 720 

ttaagatgta cccattactg gaattgggct taggg 755 



<210> 415 

<211> 852 

<212> DNA 

<213> Homo sapiens 
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<400> 415 

gnagnaannn ttctaatgct tgggnnnnnn ngtcaaacct tannaacctg gcntgncgaa 60 

ttcggcacga ggtcacaggc aggtttantg gccagtttaa aacttatgct ntctgcgggg 120 

ccnttcgtag gatggtgagt gtttccctgg gctttgctca tcacttcggg acatcgtgga 180 

ctttaccgtg cgcnttggag tgtgtgatgg tgcctgagta gatctgctgg cagagtagtt 240 

tgagccagct ggactgggct ggccgcctgc cgcttcttga gggtggaaga ggggtgctct 300 

gagaagacac tcaggcagca gactctgcct ctcactagga ggtgcccccc cgaccccgtt 360 

ccaccatagt caaggctgca ggctgccccg ggagaagtgg ctccccttct tgcgcctgtc 420 

ttccattcgc ttcaccgggg gganaagacn ttgggcttgg ttggcacagc ntgacccttc 480 

tgcccatctt naaggcagnc ccggaantgg gaaaaatatt tctttaaatg gtggcctttn 540 

nttttttttt nctttnaaag gggttgaagt tccannaatg natttcccaa tttccttccc 600 

gaattgggnc ccaaagggcc ccaatggggc antcggtcct ttaaaaagna acctttttgg 660 

acctgggaag aagaaaatca cccagattgt tgggaaatat tttggncatt aaaataaant 720 

aatggaaaac ctnaaaaaaa aaaaaaaaaa aaaactcgag cccnttaaaa acttttagtg 780 

agtcnnatta ccnttanatc canacnttga tangaanctt tggataattt tgggncaaac 840 

cnnaacttng at 852 



<210> 416 
<211> 754 
<212> DNA 
<213> Homo sapiens 



<400> 416 

ggnnnnnnng tnaaaccttc cnaannaggc tnggcgtcac tgnccccggt caacaaaccc 60 

acttttatga cagttttctt ccgcagcttg gctnttaaat tttactggca ggtgtatggt 120 

tgttggaggg ttcctagtga gttgggggac ctggcantan agctgcttgg ttggaggaag 180 

tgaanctggc ttantaccag cagctgatct cttccacgtg ctgctgcttt ttttgccact 240 

ctgatactaa accagagaaa gctgcaggtg gataaagaag ctgtggctgt tttttgcttt 300 

tgggtggcaa tgagaaagag tcacagtgtg ggttaaaggg atctgcagtg gggccaagga 360 

tgccacccca ccctcagctg tangcaagct tgcacataaa taacccccgt cagtggagtg 420 

ttcgggatgc agggggcant atagtgttct tggactttgt ccgtcctggg gcagttttta 4 80 

agttctttat atttaagtgg ggtcagtgcc aagtgctacc actttcccaa taaangaatg 540 

ggggacccan aaggctgggg tccctggcta ccttgttatg aaggttttgn tntttctctg 600 

acaaganttg ctttggaaag ancctgtttt taggggatta ttttttgnat accccgatgg 660 

gganccaggg ttctnctcaa aacccttaca acccttagga tcatagggaa aaggggcccn 720 

tnttttnctg ctggcttncc caacttaaaa acnt 754 



<210> 417 

<211> 755 

<212> DNA 

<213> Homo sapiens 



<400> 417 

ngtntatagc ttnctaatgc ttcntancga attcggancg agagaagccn tgagcagcaa 60 

agtctntcgc gacaccctgt acgaggcggt gcgggaagtc ctgcacggga nccagcgcaa 120 

gcgccgcaag ttcctggaaa cggtggagtt gcagatcagc ttgaagaact ntgatcccca 180 

naaggacaag cgcttttcgg gcaccgtcag gcttaagtcc actccccgcc ctaagttctc 240 

tgtgtgtgtc ctgggggacc agcagcactg tgacgaggct aaggccgtgg atatccccca 3 00 

catggacatc gaggcgctga aaaaactcaa caggaataaa aactggtcaa gaagcttggc 360 

caagaagtat gatgcgtttt tggcctcaga gtcttttgat caagcagatt ccacgaatcc 42 0 

tcggcccagg tttaaataag gcaggaaagt tccctttcct gtnacacaca acgaaacatg 480 
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gtggccaaag tggatgangt gaagtncaca atcaagttnc aaatgaagaa ggtgttatgt 
ctggctgtan cttgttggtc acgttgaaga tgacnngacg atgaancttg gggtataaca 
ttcacctggc tgtcaacttc ttggnggtca attgcntcaa agaaaaaact tgggcagaaa 
tgttccnggc cttatnttnt caagaaccnc catggggcna agccccaacg ccctttnttt 
aaaggcncat ttggaattaa attcntnttt ncccg 



540 
600 
660 
720 
755 



<210> 418 
<211> 757 
<212> DNA 
<213> Homo sapiens 

<400> 418 

tggggnntnn nttctaatgc tgggatgttc taaangntgg gctactcgtt ctttccgcag 60 

gancccntcg attcgaattc ggcacgagga aaggtggcgc gcttctcacg gctgagttgc 120 

tgcgcctgca gacggaagct ccccacaggc agagctgctt ggatgtgtga gtcatgaagc 180 

cagagaagcc ccgctccatg agcagtgact ccccaggccc tgtgacctcc ctcctgtctt 24 0 

gcagctcctc ctggcaccag tccccagggc tctcctgttg gtagttcctg cttttcttct 300 

tggaaattcc tcgtggacct cgagatcttt accctaaaat agttctgttg aatttcaccc 360 

tggcaatgta aattgatagc ttatcttcac agatgccaga caatggacaa ctcaccatca 42 0 

gtcctctgct cacctgagac aaatgcatgt ctgattgctt cctctgccct attgnttatg 480 

tgaaaatgca gattcactga gccagactaa ggcatcagtg actgttcctc tactgcctct 540 

cacatggaga ttgtgtattc agtgaaaggc tgatcaaaga ccccaaagga atgcaccagt 600 

ttatctctta tctacctatg acctgcgagc tgnccaccac ccccagttgt tgcgcctttc 660 

cagacagaac cagtgtcatc ttacacgtat taattggatg tcctgngnct tccttaatat 720 

gtatcaaaac aagctngcct tgaacacctt gggcacn 757 

<210> 419 
<211> 738 
<212> DNA 

<213> Homo sapiens 

<400> 419 

gnnngncgtt cnaattncgn ggnntctttc tngccnanna nnannngcgt gngngaattc 60 

ggcacgagac tgttcatcct aagttccact ataaacaggc tcatgactcg ggcacagaca 120 

cttcttgcgt gactttttcc tatgatggta atgtccttgc ctctcgtgga ggtgacgatt 180 

cattaaaatt atgggacatc cgacaattta ataaaccact tttttcagcc tcgggtcttc 240 

ccaccatgtt cccaatgact gactgctgtt tcagtccaga tgataagctc atagtcactg 300 

gtacatctat tcaaagagga tgtggcagcg gcaaacttgt tttctttgag cgtaggactt 360 

tccaaagggt gtatgaaata gacatcacag atgcgagtgt tgttcgctgc ctgtggcatc 420 

caaagctgaa ccagatcatg gttggaactg gaaatggatt ggctaaagtc tattacgacc 480 

ccaacaagag tcagagggga gcaaaattat gtgtggttaa aacccancgg aaggcaaaac 540 

aagctgagac tctactcagg actacatcat cacccctcat gccttgccta tgttcccgtg 600 

agccccgnca acggagtaca aaggaaacag ctggagaagg acagactgga tccctgaagt 660 

cgcattaacc tgaacctcct gtancangcc cangtcgtgg tggccgattt ggaacccacg 720 

ggggcactnt tttttcct 738 

<210> 420 
<211> 739 
<212> DNA 
<213> Homo sapiens 
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<400> 420 

gcgntrmtat tagcgtgggc tcgntctcgc tcnacncanc nngngctggn cgaattcggt 60 

acgagaatca gaggaggctt cttcatcctt caactccatg atgaactcct atatgaagtg 120 

gcagaagaag atgttgttca ggtagctcag attgtcaaga atgaaatgga aagtgctgta 180 

aaactgtctg tgaaattgaa agtgaaagtg aaaataggcg ccagctgggg agagctaaag 240 

gactttgatg tgtaactgtg ctgttgatga agtcctccca gggaagcctg tgcagatgca 300 

gtcacctgga aagaacagag attccctttc acctacctca gcaaaacaaa ctttcaagtc 360 

ttgatagact tagcctagta attttatagt gagagtttca aactatatat caagtgtcta 420 

tagcatcaaa aacttctggg ggcgtggggg aaagtagaat accaagtata atagttacat 4 80 

tcactttcaa agagcatcta tgaatttgcc ttttgtaact tactgtggct ttaaacatat 540 

tcagaacaga tgcttgaaat atgcacttag cactttggtt ccacatctgt ctgggtaaac 600 

catgaagaaa atgaagctgc tgcctcaatc gancccagac agcagccata ggcagataaa 660 

gatttnggtt cacccttggt ggtgggaggc atcgtgtgtg cctttttttc ctctaatatc 720 

aattttacag tccgggaan 739 

<210> 421 
<211> 727 
<212> DNA 
<213> Homo sapiens 

<400> 421 

gtgatctttn tgagtggggg ccntnctngc tctannanat aggttnggng ggctagcgat 60 
ttctacctgc gctactacgt agggcacaag ggcaagtttg ggcacgagtt tctggagttc - 120 

gaatttcggc ccggacggaa agcttagata tgccaacaac agcaattaca aaaatgatgt 180 

gatgatcaga aaagagctta tgtgcacaag agtgtaatgg aagaactgaa gagaattatt 240 

gatgacagtg aaattacaaa agaagatgat gctttgtggc ctcccctgat agggttggcc 300 

gacaggagct tgaaattgta attggagatg agcacatatc ttttaccaca tcaaaaatag 360 

gttctcttat tgatgtaaat caagtcaaag gatcctgaag gccttcgagt attttactat 420 

ttggtacaag acttgaaatg tttagttttc agtcttattg gattacactt caagattaaa 480 

ccaatttaaa ttgtatgttt tcaagctggt tgnatattta attaaaggga tgggaagggg 540 

ttatttgtca tttacagtat tggggtttta tgaatgtgaa gcaaccaaaa aaaattnnaa 600 

tgtaaaactg gaaaatagga aaattcatta ncagcttaat gggtatcctt acttgatncn 660 

ctgsgtttgg aagtccccac acacattaaa tctgtaatga aancnctttt ggttaaaatt 720 

tctctat 727 

<210> 422 
<211> 753 
<212> DNA 
<213> Homo sapiens 

<400> 422 

gtntngnnng nngttnnatt atatggntcg nctnnctcna nnancnangc ttgngctgac 60 

aacttgattg ggttctcctt caggtttgaa gcgccctcna gaagtgtcta aaggagacag 120 

ttgatagcca aacaacagtt ttggattcac tgactgatta tgaaagaagc agtagactgg 180 

tatcaagaat cagtcagcaa ggaggccctc accagacgcc agtgccatgt tcttggactt 240 

ctcagcctcc atattcatga actaagtttt tggaatcctt aggcttccac gtgtggaaag 300 

cctgagctaa cctactggag gatgagccat cacctggagc agattcaggc catcctagtt 360 

gaagcctccc taggccaagc aaccgtccaa ctaccagaca ttgaccattc agccttgaac 420 

attcagcaca aagacaaaac agaccagacc agaagagtcc cacagaatag gggaaactat 480 

tcagagaaaa cttaagccac taagttttat ggtgttttgt tcttgtagcc agaagcatag 540 

gcatactggc caatacaaac cgaaatcctt ctaacgtant ggaccctttt caggccagca 600 



103 



WO 99/33982 



PCT7US98/27610 



ttttttccct tgaaaacctg ggagccttgt attccatctt attagcagaa gatcactttc 660 
accaatggtt tgggctcttg atttggaatt gatgatgtaa tgagcctnta ttcnanatgn 720 
gacttaatac ctctgcgaat tgactggatt ccn 753 



<210> 423 
<211> 644 
<212> DNA 

<213> Homo sapiens 



<400> 423 

nggnnnttnn nnnnnatncc ntgatcgtgt ntcgttcttt ctncaggatn nnntcgtttc 60 

gaattcggca cgaggaaaag ggagccgcgc agngcctacg ggagtncggc ggcagcagcc 120 

ggtaccggca accacgggca gctctcaggg aatctccgtc gttgaggcca naggctccag 18 0 

tccccgcgag tccagatgcc tgtccagcct ccaagcaaag acacagaaga gatggaagca 240 

gagggtgatt ctgctgctga gatgaatggg gaggaggaag agagtgagga ggagcgganc 300 

ggcagccaga cagagtcaga agaggagagc tccgagatgg atgatgagga ctatgagcga 360 

cgccgcancn agtgtttcag tnagatgctg gacctggaga agcagttctc ggaagctaaa 420 

nggagaagtt gttcaaggga acgacttgan tcanctgccg gnttgcggct tggaaggaaa 480 

nfc gggggggc ttgaanaaga agcccctgga atnccaccgg aagccccctt ttgggggggg 540 

gccttgcaaa ccgggaancc ctttnaaagg aatttcngcc antttcaang gttgggccaa 600 

ggggaatcnt accnaagggg ccttctnggc cttggnatgg tgaatccang gnaaattaag 660 

gtncccaatt gntgaancct tccaanggga ancccaaacc agcacccttg naanaagttg 720 

agaaaacttg cttgcntctt ntgacacccc tncnaggggg aacttcaagg aaccggttcc 780 

tnaggcttgg aaggaggacc cccananccc tggancctaa attnttaaat gggtnggacc 840 

accn 844 



<210> 424 

<211> 799 

<212> DNA 

<213> Homo sapiens 



<400> 424 

ggagnnnngn ntccnaattn nntgggnnnn nnngtcaaan nctngctact cgttctttcc 60 

gcaggatccc atgcgattcg aattcggcac gagcccagac ctatggagtc agacagtagg 120 

tttgaggccc agcaatctat ggtttaacaa gccatccagg tgtttctgat gcacagtgaa 180 

attggggtac cactggtatt aggtttggta tggcaacttt ttcatcactt gttttatgta 24 0 

gttgtctgat caattgtgaa aacataatga atgttggaaa tggaacagta aaataacgaa 300 

agccaacttt tttttttttt tttgagacgg agtcttgctc tgtcgcccag gctggagtgc 360 

agtggcgcga tctcggctca ctgcaagctc cgcctcctgg gttcacgcca ttctcctgcc 420 

tcagcctccc gagtagctgg gactacaggc gcccgncacc acgcccggct aattttttgn 480 

atttttagta gagacggggt ttcaccgtgt tagccaggat ggtctcgatc tcctgacctc 540 

gtgatccacc cgcctngggc ttccaaagtg ctgggattac aggcgtgagc caccgggccc 600 

gggccaaaag ccaactcttt atgcctagaa aatattgtgc accctatgac ccaagcccat 660 

tgaatttttn cngggaaatt tatggtaaat tattgaaatg gatggtacct ttaaaaagtt 720 

atttggcaca ttccccttgg gttacctttg gnatggtttg ccagggaatt naaaactttg 780 

ggntnaaacc ttttttann 799 



<210> 425 

<211> 750 

<212> DNA 

<213> Homo sapiens 
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<400> 425 



gangccggat 
ncccatgcga 
cctcagaaca 
cacatcagtg 
aacaagatac 
tagaggtgta 
ctggaacgtc 
ccacagtgag 
ggactccggc 
tagaactttt 
taccaacaag 
ctcctgaaaa 
aaaactttan 



tccaattntc 
ttcgtggagg 
tgctgagcac 
tttaaagaga 
atactactga 
tatttttcat 
tggtttctgt 
acctggagcc 
ctgtttctac 
tgatagtgtg 
aatgtnatag 
aaaaaaantt 
gggncgttta 



nggctnctct 
tctcctttcg 
attttgtagg 
aaaaagtgac 
tttttttttt 
actgtggggc 
ccccgggccc 
accccttcct 
cttctattca 
gtaaaaacat 
gtgctacttt 



nnnnnnnnnn 



cctanaccct 



naaannctgt ntaatgcttg gtccgcanga 
ccccagccca ggtggccaag cccatcctgg 
gtggcacctt tttatccaag ttactagcta 
ctttcatttt tttttcttga aacttgagga 
cttaaaacta aatgcatgac tgcagagcgg 
aaagtatttg tgctgctttt tggagatgga 
ggcagctacg tctattttct gtagaaggtg 
gcctggcgcc gtttanagct gggagcccgt 
accactctga cgtggggaga caagaagaaa 
tggattttga actattttag taaaaggagt 
gagctagata aataaaggct ctttgtgagc 
atnannnnnn annaaaaaaa ctggnccttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
750 



<210> 426 
<211> 819 
<212> DNA 

<213> Homo sapiens 
<400> 426 

gnagnncggn ttcttatgat cgtggctnct cntctanngg ttgtgtaatg ctnggtcnnc 60 

angannnnnt gcganncgaa ttcggcacga aggggggttc ccaatagtag aaaagggtcc 120 

ccattcctgc tcagcaccgc acctctctac ccccccacag acacacatgc agacacacac 180 

atgcagacaa cacgcagaca cacacatgca ggcactcaca tgcaggccca tgcacacaca 240 

cgtgcacaca catgcagaga catgcagaca cgcaggcaca catgcacaca tgcaaagaca 300 

cgcatgcagg cacacgcaga cgcacacaga gacacacatg cagatcacat gcacacacac 360 

atacacacac tggcccctgt ttttctgtgg tgtcactggg tgccagcaac tcggtatctn 420 

ccaccttcca ctaaaacctg ggccttaatt tctctcccgt ccccacccct aaattcctga 480 

tggatgaacc tagagctgtc ctgtccactc caggccggac tgacgtancc tatgggccca 540 

gcaggtccag ggcccacgtt ttaatttctt tttnaaaagc tttaggtctt ggccnggccg 600 

ccggtggttc acgccttggg agttcccagc atttttnggg aaggccnaag gccgggttgg 660 

attcacaaag gtcaagcaag tttcaaggaa ccaagccttg aaccaggcca ttgggtgagg 720 

aaccctgggc ttnttactng ggnaaattcc caaaaaaaaa ttggccttgg gccnaagggt 780 

gggcaagggc acccttgttg gggtccccaa antttacct 819 

<210> 427 
<211> 750 
<212> DNA 

<213> Homo sapiens 
<400> 427 

gagnnngatt cnaattnctg ggctnctctc ttnntatnta atgctgggtc cgcangancc 60 

nntgcgattc gaattcggca cgaggtccaa ggacaacttc gagacatttc tttttgccac 12 0 

cgtatctaac agggagcagg aagatctctg ccgaggaatt gtccagctct gcttcaatga 180 

gcaaagccaa cagctgctag cagaggtcca gccctctgac tctttcctca tggtagagac 24 0 

aactgcatac tttgaggcct acaggcacgt cctggaagga ctccaggagg tccaggagga 300 

agatgttccc ttccagagga atatcgtgga gtgtaactct catgtgaagg agccaaggta 360 

cttgctaatg gggggcagat atgactttac ccccttaata gagaatcctt cagccactgg 420 

ggaatttcta agaaatgtcg agggtttgag acatcccaga attaatgtct tagatcctgg 480 

ccagtggccc tcaaaagaag ccctgaactg gatgactcca gatggaagcc ttgcagtttg 540 
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ctctcacaag ggaactggct attattcaag gaccttctgg aacaggcnaa acctatgtgg 600 

gtctnaaaaa ttgttcaagc ccttctacca acgagtcttg tttggcaaaa ttaaccttca 660 

gaaattccca tcttggttgn gtgtatacta atcatgcttt ggaccanttc tggaangctt 720 

ttccattgtc agaaaaccan atttggccgg 750 

<210> 428 
<211> 943 
<212> DNA 
<213> Homo sapiens 

<400> 428 

gnngnccggt ttcctattct cnggcanctc tcttcctncn acctattanc tggactctaa 60 

anaaaagnnt gnngcggttg gctcaagggc caccanaaca tttctttatt at tat tat tt 120 

tttaacctgn acatgcntta aagggtctat tacctttctt tccgtctgtc tcaacagctg 180 

aaatggggcc nccaaggagt gccttccttt tgctccctcc tactgggact gacggntggn 240 

antgtntgnn cccanntggg ggtgtctcct gnctgggaag ganggaaagg gaggcanagt 300 

tttgccgggg ttgcanntng acancangct gnanaggana tggctaataa ctgtttaatg 360 

gaaacctgct tgggcttgga nggaacttag nctgaatttt cccgacttcc tctgccagtt 420 

attgacacan tctctttnta agacangaaa taaactaaac cccaccccaa ggnantnatn 480 

ncangcngaa aacnncncat ngcccacatt ncctnatccc ntancaccnn ctcnttnttt 540 

nncccaanac tncttcccan ntntcnccnt ttacccntan ncntnnttnt atcccnctaa 600 

tncctnannn ccntnnttnt ccnnatnctt acncncncnn ntnnnncccn nntctttnnn 660 

cccaaanctn ncctcncnnt tcnnctnaac cntntnnnca nnanacaccc ttctnatnnc 72 0 

ccannntctn cacnntnnnt ntctccnnnt nnncnccnnn ntcntnnnna nancntntnn 780 

nanancnatc tnntncncnn cnantnnnnt tcanttcacn ctctnnnnnn tntanccnat 840 

tnnccntcnn tnnccnntta nnncntnnnn nncnaantnn nnnnanctct ncnnncnnct 900 

canntnnnnt nnnnnncnnt cnanaccntn nnnntctatn ccc 943 

<210> 429 
<211> 775 
<212> DNA 
<213> Homo sapiens 

<400> 429 

gnangnnnnn nntttctaan tncttgggnn nnnngtcann gattnngcta aaggttngga 6 0 

tcncncgcag naangctgtg gcgctccatt gtgaaagatc caggcatttt tccgagccag 120 

gaaaagccca agatgactac aggatattag tgcatgcacc ccaccctcct ctcagtgtgg 180 

tacgcagatt tgcccatctc ttgaatcaaa gccagcaaga cttctctgct gctgtgatct 240 

gcacaccctc caacctgggc agggactggg gggatgcagt gtgtgttagt gcccatgtgg 300 

cattgtggca ctgttgcccc ccatggcggc atgggcaaga tgaccttcca ttagcttcaa 360 

gtcttgttct cttgtctgtg gtctgtttaa tatgtgggtc actagggtat ttattctttc 420 

tcccatcctt acactctgga tcattgtgca gacttaatca gggttttaac gctttcattn 480 

nnnnnnnnnt ttttttgagc tcaaagaaag ttctcatttt ccctattcaa ctaataccca 540 

tgccgngttt tttaccttgg atttaaaggc accttangtt ggggcaacag attctcactc 600 

atgtttaana cctggnattc ancttcataa gaccaaagan ggagctttcc ctttctcttt 660 

acccctnagg attctcatcc tttacanntn gactttttcc aggccaattt cccatnnaat 720 

ctgcnanncc cngccttttg ncccaagctt ttntgntngn ccccccattt acccn 775 



<210> 430 
<211> 763 
<212> DNA 
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<213> Homo sapiens 



<400> 430 

ngggtgnnnn nntttctaat nctggggnnc nntnnncnnn ntttcctaat ncttaggngc 60 

tcgttctttc tccangcagn nnngcgtttc gcgacagctc tccaatactc aggttaatgc 120 

tgaaaaatca tccaagacag ttattgcaag agtttaattt ttgaaaactg gctactgctc 180 

tgtgtttaca gacgtgtgca gttgtaggca tgtagctaca ggacattttt aagggcccag 24 0 

gatcgttttt tcccagggca agcagaagag aaaatgttgt atatgtcttt tacccggcac 300 

attccccttg cctaaataca agggctggag tctgcacggg acctattaga gtattttcca 360 

caatgatgat gatttcagca gggatgacgt catcatcaca ttcagggcta ttttttcccc 420 

cacaaaccca agggcagggg ccactcttag ctaaatccct ccccgtgact gcaatagaac 480 

cctctgggga gctcangaag gggtgtgctg agttctataa tataagctgc catatatttt 540 

gtagacaagt atggctcctc cgtatctcct cttcctagga gaggagtgtg aacaaggagc 600 

ttagataaga caccccttaa acccattccc ttttccagga gacctaccct tcacaggcac 660 

aggtccccaa atgagaagtc tgctacctca tttctcatct ttttactaaa ctcaaangca 720 

ntgacagcag tcagggacag acattcattt cttnatacct tec 763 

<210> 431 
<211> 761 
<212> DNA 
<213> Homo sapiens 



<400> 431 

tggtgttnnn ntcctaatgc ttggnngnnn ggtnannctt ctaattactt tggggctcgt 60 

tctntctcna ennngenngg cgttncgaat teggcacgag ettgaagege tggtttttct 120 

cgaagcaatc cttattatat tgttaaacaa ggaaagatca accagatggc aacagcacca 180 

gattctcaga gattaaagct attaagagaa gtagctggta ctagagtgta tgacgaacga 24 0 

aaggaagaaa gcatctcctt aatgaaagaa acagagggca aacgggaaaa aatcaatgag 300 

ttgttaaaat acattgaaga gagattacat actctagagg aagaaaagga agaactagct 360 

cagtatcaga agtgggataa aatgagacga gccctggaat ataccattta caatcaggaa 42 0 

cttaacgaga ctcgtgccaa acttgatgag etttctgeta agegagagae tagtggagaa 480 

aaatccagac aattaagaga tgctcancag gatgeaagag ataaaatgga ggatatcgaa 54 0 

cgccaagtta gagaattgaa aacaaaaatt tcagctatga aagaagaaaa agaacagctt 600 

aatgctgaaa gacaagaacn gattaagcag aggactaant tggagcttaa ageccaagat 660 

ttacaagatg aactaccggc aatagtgaac aaaggaaacc gttttttaaa agaaangcen 720 

aanctgcttg aaaaaaaaaa aaaaaaactc ggectntaan t 761 

<210> 432 
<211> 748 
<212> DNA 

<213> Homo sapiens 



<400> 432 

gnngantnng tcttattatc gtggngctct nactnnctct aaatanaatt gtgttgnggg 60 

aatteggcac gaggccaccg aagcttcagg atgacatctt agactctctt ggtcagggga 120 

tcaatgagtt aaagactgea gaacaaatca acgagcatgt ttcaggcccc tttgtgcagt 180 

tctttgtcaa gattgtgggc cattatgett cctatatcaa gegggaagea aatgggcaag 240 

gccacttcca agaaagatcc ttctgtaagg ctctgacctc caagaccaac cgccgatttg 300 

tgaagaagtt tgtgaagaca cagctcttct cacttttcat ccaggaagcc gagaagagca 360 

agaatcctcc tgcaggctat ttccaacaga aaatacttga atatgaggaa cagaagaaac 420 

agaagaaacc aagggaaaaa actgtgaaat aagagctgtg gtgaataaga atgactagag 480 
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ctacacacca tttctggact tcagcccctg ccagtgtggc 
cttccaaaat ccatatcctc actctgagtc ttggtatcca 
gtctgagatt tggatccctg gnattggatt tcttaaggac 
catgcttcac agaacttggg cttcanaagc ttcanttttt 
ggaaaacagt tntncttgtt ttgtannt 



aggatcagca aaactgtcag 
ggtatttgtt tcaaactggt 
ttttggangg ctcttgacac 
tgcanaggtg ccccaggtta 



540 
600 
660 



720 



748 



<210> 433 
<211> 769 
<212> DNA 
<213> Homo sapiens 

<400> 433 

gggnaaaagt ttnnnannng ggnagnnnng ntnnaccntt cctattactt tggagctcga 60 

actcgcncca canannnagt gncntgngct gttttgcaga tgaggaaaac tgaggtacag 120 

aattcttagg gaacttaccc aaaatggctt ttctgcactc tgccctttgg tattgtccca 180 

tgtgaattgt ttaaaactta tgtgtatagt ggcatgagta ggtgatttca gaaacagaac 240 

tcacttttgt tgtttggtct taaaattagg aacttttctt catctgggct tcatttccct 300 

gcaccttccc agctttctag tcatgcaagc cacatgtctc cacgtgaggg gttcattgga 360 

aagcagccac agagccaccc cctggctggg ttcttcccca gctctgcttc ctccttcccc 42 0 

aagtcctgca gctgctctct ccatggcaga accacttctc cccttactgg aggggaggtc 4 80 

cactgaacaa atccaggaga ggaatcattg tgttttccac agaagagaaa gtacactgga 540 

ctttctgtgc aacctgttac tacattttca caganactca tatttgtgca ntgtaactca 600 

atttgaaacc cagcaaaatt aggctcccgt gtctccataa aaggccacca tgatggtaac 660 

cgttggactt caccttgtgt ttnggacana ngctgattgg attttaccca tcatcacanc 720 

cgtgtcttac attctcnttt cctgggcttt ggacccctgn tanaaaaan 769 

<210> 434 
<211> 764 
<212> DNA 
<213> Homo sapiens 

<400> 434 

ctanccttcc taaannctng gctactcgnt ctttctnnan ganncnnntg cgatncgaat 60 

tcggcacgag caccttgcct ggccaagggg ctagacctcc caggctaagc ctcagattca 120 

gtgcaggaca caagctcatg cccccgtctt gccagtgaca cttgaagcct cccgacttcc 180 

acagagtgct tcaggacaca ttttgagtgg tattttcttt tctttttttc ttcttttttt 240 

tttttgagat ggagtctcgt tctgttgccc aggctggagt gcagtggcct gatctcggct 300 

cactgcaacc tctgcctccc aggttcaagc gattcttctg cctcagcctc cagagtagct 360 

gggactatag acatgcacca ccacgcccgg ctaattttgt atttttggtc gagacggggt 420 

tttgccatgt tagtcangct ggtcttgaac tnctgacctc aagtgatcca ccactcggcc 480 

tccaaagtgt tgagatgaca ggcacgagcc accagcccaa cctgagtggt attttcttta 540 

gggaccangt agactttaaa acgagggtaa gagaaaaagc ccagtggtct tttctgangg 600 

taaataaatt tctgcccagg aaacnttncc aagccccaac cagcaagcca acccttaaaa 660 

aaaaaatcac ttcgtgttcc ccaangggan ctttnttaaa gctttggggg cttccaggna 720 

aaatcatttc cagtnnaant ttggaagaat tcannagnat ttnt 764 

<210> 435 
<211> 755 
<212> DNA 

<213> Homo sapiens 



108 



WO 99/33982 



PCT/US98/27610 



<400> 435 



gnimnntttc taatgtggnn 
tncagatncc ntcgattcga 
aaatctgacc catttaaagg 
tttgcagaac agcagacaac 
gacccattcc gtggctctgc 
tttacctcgg atccattcac 
tccagtgatc ccttttcatc 
ttagatccct tcggaagtgg 
atgtccaagg gtgcctgggg 
ttgaacanag agctggtcag 
tgaatcccta ancagcanca 
ggnaagtctg ggacattggg 
tttatgaata aagtactttg 



nngnnngnta annttctaaa ncttgggntc tcgttctttc 
attcggcacg agggatcctt tccagacaga agaccccttc 
agctgacccc ttcaaaggcg acccgttcca gaatgacccc 
ttcaacagat ccatttggag gggacccttt caaagaaagt 
cactgacgac ttcttcaaga aacagacaaa gaatgaccca 
gaaaaaccct tccttacctt cgaagctcga cccctttgaa 
ctccagtgtc tcctcaaaag gatcagatcc ctttggaacc 
gtccttcaat agtgctgaag gctttgccga cttcagccag 
aagagccact gcgcatgtta tctttggtgt tactccagtg 
aggcagtgca tcgcanagag acattaataa gggaatcctt 
gctttnctga nggggccnat gatgccagtg acctnttcan 
accaccctgg ggggaagaac ttgtgggatg tggcttttct 
agttggttgn aatcn 



60 



120 



180 



240 
300 
360 
420 
480 
540 
600 
660 
720 
755 



<210> 436 
<211> 760 
<212> DNA 

<213> Homo sapiens 
<400> 436 

aaggctggnn nnngnnntgc nnnncttcnt attantctgg gggctcgtnc tctctcnann 60 

nagnnaggcg ntgngaattc ggcacgagct caagaaaagg agaaagtttt tttgtatgaa 12 0 

attggaggaa atattgggga acgctgcctt gatgatgaca cttacatgaa ggatttatat 180 

cagcttaacc caaatgctga gtgggttata aagtcaaagc cattgtagaa gacttaacaa 24 0 

gctgcagata accatgtgga cttctgtcat aattcttgct gagtcaagag tgtaaataaa 300 

agaaatggca ggactcatat tattcagttg tcccaagtat ttaaaaatga ctctcttaag 360 

ccttaaaaag tcatagattt gtgctgctgc cagaattata ttaattatta ttaatggtat 420 

tattagaaaa aaaatttctg gagtgagagt naagangctt aattagtttg tgggcagttt 480 

tcatatgctc tgtgaaatgt gtccagatgt gacataagtt ttttttttta atatggngga 540 

aatgncttct ctttcccatt cttttctcct aaaaatcata tatactggga atatatgcct 600 

ctnttacctc tattaccctc ctcacattta ccctttccca gttnggtttt gctttttnac 66 0 

caaaaagatt ccaatnccna ggtattggca agttntnaaa accgcccntt aaacatccct 72 0 

aatttcncag nattccnnnc ttgccaaatn ttngtntcnn 760 

<210> 437 
<211> 748 
<212> DNA 

<213> Homo sapiens 
<400> 437 

ggnnnnngnn ngntnncgtt ccctattant caggngctcg ntctntctcn annnancnng 60 

gcgtgtncga attcggcacg aggattttcg aaactcttca gctacttgcc cttttttatc 120 

tgaaaccatc ataccttctg aaagaaaaaa gcatatcttc attgacataa cagaagtgag 180 

atggcccagt cttgatacag atggtccatg atatatatgg agagtggcat tgtgaagata 240 

acatctttag atggtcatgc atacctctgc ctgcccagat ctcagcatga atttacagta 3 00 

cattttttgt gtaaagttag ccagaagtca gactcatctg cagtgttgtc agaaacaaat 360 

aataaagccc caaaagataa actagttgaa aaaactggca aaatctgtat acgtggaaat 420 

ttaccaggac agagactgaa gaataaagaa aatgagtttc attgccagat catgaaatcc 480 
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aaagaaactt taaagaagat gagttgtgta aatggaactg aagggagggg aagaactgcc 540 

ttcgcctggt acaaagcaca catgtgtata cacatgggtc aagcagtgct ggtctgtggc 600 

tgnctgtcca gangaatgga aatatccttg gctttagcac ttcattttca taataaaatc 660 

agcaattntg tctaaaaaaa aaaannnana aaaaactnga gcctntanaa ctntagtgag 720 

tcgtattacg tagatncnna catgataa 748 

<210> 438 
<211> 823 
<212> DNA 
<213> Homo sapiens 



<400> 438 

taatccttnn tattgntcgg gtactngntc tntctcnaag annntntcgt tncgcccagg 60 

tagctgagac tacccacacc ttggtcccag ctacttggga ggctgaggtg ggaaaatcac 120 

tttgcccagg aattcaaggc cgcagtgagc tatgattgca ccactgcact ccaggcaaca 180 

gagtgagacc ctgtcttaaa aaaagaaggg agaaagtgtc agatggtgat gaggtctggg 240 

ggggaaatag agaatgggga tcaggagtgt ggatggtggt attccctcac caagaggtga 300 

catgttgagc agggaacttg ggaggtgagg gtgtgacccg tgtggaaatc agggaaaagc 360 

attncagcct gagggacagc caatgcanag gccgtgaggt ggccagtgcc actgagcagt 420 

gagcttggga tagggggcan gtgangaggc tggagagcgg ggtcagacaa accaatatgc 48 0 

ttatttaaaa caaggttgtt ncagcaccct tgccttaaag ccttgagcct gnaanctnga 540 

aaaatttggg cacnttcaaa agcanggang gaaaccaaaa gaagattggg agggaaaagc 600 

ccttncnttc ccttancagg aaatgaagtt nccacccttn aaaacaggnc caggaccttt 660 

ttgggaccct tttggccttt tggttcctta gaatcctctt ggtngcttnn gaatnaaaag 720 

gnaaaagggg cctttaaggg gggatcccat tntttccaaa attcaaaggg ggctttccct 780 

gggcttaccc aaaatttctt ggncttaant aaaaaaattt ntt 823 

<210> 439 
<211> 767 
<212> DNA 
<213> Homo sapiens 



<400> 439 

gnnnnngntt ctaatgctgg nnnnnnnngg taccctttcc aaaacctggg ctctcgntct 60 

ttctncangn agccnngcga ttcgtctgtc tggtgatttt tattttaagt gaacctttgg 120 

atctatcttt aactctcttt attgtgagtc taaattccaa ttctgcagca gatcagtaaa 180 

ctcacagtat ttttcctgtg gaaatctatt caataaggaa accaagacag gatantaaaa 24 0 

tttaaaaaaa ancaactttg aattcccctg cctaggtctt ccagttgttt tccagcgcat 300 

acctcaggta tgactttgct agccggggac aaaattagca ccttccgatt ctctagtcca 360 

aatgaacttt ggctaaataa aaaattatta tactacataa taaagttnca gatagcagga 420 

aatgcaagag ctaggagatt cctagattat atctggccaa gccaaatacc ttaaacatcc 480 

acctggaaat cctctacccc ctcttctgag ataatttgcc cagccctttc ttcccacaca 540 

ctcactcaat gtcaccccct tctaatcccc aaaactgttt ttgtggcctt ggtagcctat 600 

agtagtttct cacatctttt cccctanact tttctgtttt cagtttcaga ccaaaaaaac 660 

tcttcaactt ttttccagtg gggtcttcct taccagtaac tttaccactt gnaatcttat 720 

ttcattgaaa aaaccttaaa tgggntggga aaaggcttgc cnncann 767 

<210> 440 
<211> 752 
<212> DNA 
<213> Homo sapiens 
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<400> 440 

nagnnnnntt tctaatgctt ggnnnnnnnn tcnatgcttc caaaagcngg gngctcgttc 60 

tttctccaag atncnngcgn tncgaattcg gcacgaggat ggatgagact gttgctgagt 120 

tcatcaagag gaccatcttg aaaatcccca tgaatgaact gacaacaatc ctgaaggcct 180 

gggatttttt gtctgaaaat caactgcaga ctgtaaattt ccgacagaga aaggaatctg 240 

tagttcagca cttgatccat ctgtgtgagg aaaagcgtgc aagtatcagt gatgctgccc 300 

tgttagacat catttatatg caattcatca gcaccagaaa gtttgggatg tttttcagat 360 

gagtaaagga ccaggtgaag atgttgacct ttttgatatg aaacaattta aaaatcgttc 420 

aagaaaattc ttcagagagc attaaaaaat gtgacagtca gcttcagaga aactgaggag 480 

aatgcagtct ggattcgaat tcctggggaa cacagtacac aaagccaaac cagtacaaac 54 0 

ctcctacgtg gtgtctactc ccagactncg tacgccttca cgtnctcctn catgctgang 600 

cgcaatacac cgcttcttgg gtcangaatt agaagctact gggaaaatct accttccgac 660 

agaagagatc attttagatn taccgaatga anaaagcttg cattagtgac attgaaaggg 72 0 

aaataaaaat tcctacagtc naaaaaaaaa at 752 

<210> 441 
<211> 775 
<212> DNA 
<213> Homo sapiens 

<400> 441 

gnagccngat tccaaaacct gggnnccgat ccaatgcttn ccaattactt gggagctcnn 60 

actngcncna ncaanctngc cntgcgaatt cggcacgaga agnaggcgga gcttgcagtg 120 

agctgagatc gcgccactgc actccagcct gggcaacaga gtgagactct gtctcaaaaa 180 

aaaaaaaaaa aaatggaacg cagggcaaga actcgtnttt ggaaggagat gggggaaagg 24 0 

ancggtatta tacctatgtt gnatttgcag gcaaatgaga tgganccctc tctgtaaaga 300 

agagtcattt gtgcaagtag acggggtctg tgggtgcang ccctggaggg gcacacaatt 360 

gcctgnangc ttctgtgana tcgggagang gaggagaagc agtctcttga caaaataaag 420 

tatttttatt cattngtatt tattaaatga aaaaacaatc ccatggtgtc ccttgtgtgt 480 

ggtggaacct aatgactgtt gaaataaagt ctgngttttc ccttcaaaaa aaaaacncnn 540 

anaanaaaaa ctcgagccct ntaaaacctn tngngagtcc gnattaccnt anatcccnga 600 

cnttgataag gatccattga tnaantttgn cccaacccca actnngaatg ccnngaaaaa 660 

aaattgcttt atttgggaaa tttgcnaatn ctttgcttta ntttgnaccc antttancnt 72 0 

cannnnccaa gttacnancn ncaattgcnt tcatttangg ttcaaggttc aaggg 775 

<210> 442 
<211> 804 
<212> DNA 
<213> Homo sapiens 

<400> 442 

gagnnngntt ctatacctgg gnncgatcca aancttncct attaccttgg atcttnngct 60 

atctcnaann aaaangcttn cgaattcggc acgaggccac ctgcactgag gtctgggccc 120 

999gacaggg tgctttagcc aggcttgtct gcgcctcagg gaagggtgag cagcccaggg 180 

accagatgca agttggtggg cccctccacc cntccnacgc cactccccag tgtgctgggt 240 

cctaaccagt cgtcctatgg gagcagtcag ccttcctctc ctcctcaggg cagctctccc 300 

acctgctgnt ccccgcacac agaacctcat tgctctgagc agttgcttat tacccagttg 360 

ttgaaaaact agcatgtgan ggccgggcgc ggtggctcac gcctgtaatc ccaacgcttt 420 

gggangccaa ngcgggtgga tcatgangtc aggagatcaa gaccatcctg gcttaacacn 480 

gtgaaaccct gtctctacta aaaatataaa aaagtancca ggcgtggtgg tggcccctgt 540 

agtnccaann tacttgggaa gctnangcan gaanaatggc ntgaacccaa gaaggaagaa 600 
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cnttgcantg aancttaaaa ttgcgcccac tggaatttca aaccttgggc cnanaanaat 660 

tgaagaatcc cgtcttaaga aaaaaggaaa aaantttncc nttntnaaag gcccggccac 720 

aantnggctt taacgccctg gtaaatnccc aancactttt tggggaaggc ccaaaggcaa 780 

ggccnggatt caattttnna aggg 804 

<210> 443 
<211> 786 
<212> DNA 
<213> Homo sapiens 

<400> 443 

gnagccggat cttattattg gcnncgnttt aatgctggct aatntntcgt aatncttggt 60 

nnccccaann annnaggngg ggngaattcg gcacgagcac catttttatt ttgatgctta 120 

cactcattta ttctgttttt gtaaaacagt ttcaagaatt taaaaatcct tccagttaat 180 

agagcttttg ttattatatt ataattttgt aaacccactt tgtttttccc actttaaagc 240 

cacagggtcg actcatggat gatacctcta ttgctgctgc atgatgttca agaccggccc 300 

ttggctgttg ttacagagat gttgggcaga gctatgcagg tgtttcattg ngaactctag 360 

ctttgatcat ggtaaaaagt taaccctttc tattttttaa tggatgttat accaactatt 420 

cagaggactc atacttcaaa aatattagga aaatctgtct tatagttctc taataaatat 480 

ctgaaatctc aagtacgaca tgaaagaatg tcagaccatt gntattggtg aaagtcattt 54 0 

gatgaatggn aaattctatg aaaagtaagt ggatttgcat ggattaatat cagggaaaat 600 

ttaagccttc ccaagtgtga ctgggccaaa gagagccaga tgcccccagt gcctgtgccc 660 

ataaagttcc cgaatccccc aatggggtct nttttcaaaa acttggncca gacccggaaa 720 

ataaaancat tcntcataaa ttcaannggg gncctcanga aacacnttcc cccancaacc 780 

cttngg 786 

<210> 444 
<211> 760 
<212> DNA 

<213> Homo sapiens 
<400> 444 

gnagnccggt tcnnangcnt nggctnnatc caatgctggc taaagttcna ananctggca 60 

acnccaggan ncangcgttg cgaattcggc acgaggagga attacaggta gcaaattatg 120 

gagttggagg acagtatgaa ccccattttg actttgcacg gaaagatgag ccagatgctt 180 

tcaaagagct ggggacagga aatagaattg ctacatggct gtttnatatg agtgatgtgt 240 

ctgcaggagg agccactgtt tttcctgaag ttggagctag tgtttggccc aaaaaaggaa 300 

ctgctgtttt ctggtataat ctgttgccag tgggagaagg agattatagt acacggcatg 360 

cagcctgtcc agtgctagtt gcaacaaatg ggtatccaat aaatggctcc atgaacgtgg 420 

acaagaattc gaagaccttg tacgttgtca gaattggaat gacaaacagg cttccctttt 480 

tctcctatng gtgnactctt atgtgctgat atnccatttc ctagtcttaa ctttcaggag 540 

tttacaatng ctaacactnc atgatngatt cantcatgaa cctcatccat gttcatctgn 600 

ggcaattgct taccttgggg gntcttttaa aaagtaccac gaaatcatca tattgcatta 660 

aaacccttaa aagttctggt gggnatcaca gaagacaagg ccnaanttna aagnggagga 720 

attttattat ttaaaagaac cttttgggtn ggatnaaaan 760 



<210> 445 

<211> 761 

<212> DNA 

<213> Homo sapiens 



112 



WO 99/33982 



PCT/US98/27610 



<400> 445 

tggtgccggt tcttantctg ngctctcgtc ttccttctta tacctgggca ncncttggcg 60 

gccccnaggn tcccangnag ccnngcngng ncngattcgg cacgagattc caaaggtttc 120 

aaagaacttg gtcataaata tgataatgag aagacaaagt atttatatta aaacagttta 180 

gtagccttca gttttgtgaa aatagttttc agcacagaaa ctgacttctt tagacaaagt 24 0 

tttaaccaat gatggtgttt gcttctagga tatacacttt aaaagaactc actgtcccag 3 00 

tggtggtcat tgatggcctt tagtaaattg gagctgctta atcatattga tatctaattt 360 

cttttaacca caatgaattg tccttaatta ccaacagtga agcactacag gaggcaactg 420 

tggcattgct tccttaacca gctcatggtg tgtgaatgtt ataaaattgt cactcagata 480 

tattttttaa atgtaatgtt atataagatg atcatgtgat gtgtccaaac tatggtgaaa 540 

agtgccagtg gtagtaactg tgtaaagttt ctaattcaca acnttaattc ctttaaaatn 600 

cacanccttc tgcctctgna tttggaagtt gtcagtncaa ctcatcaaag aaaactgcct 660 

aatntnaaaa tcatattntg ggaataattt ccctcttttg tagtctgccc aagatcctta 720 

aagattggat ttttattact atttaaacca gtggattaat n 761 

<210> 446 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 446 

tgggnnngnn ccnaangcng gggannnggt ccccgttcca anactggaan ncttggcann 60 

cgaactcgct cnannagnaa ggccgggnga attcggcacg aggccccgct ccatgagcag 120 

tgactcccca gctcctcctg gcaccagtcc ccagggctct cctgttggta gttcctgctt 180 

ttcttcttgg aaattcctcg tggacctcga gatctttacc ctaaaatagt tctgttgaat 24 0 

ttcaccctgg caatgtaaat tgatagctta tcttcacaga tgccagacaa tggacaactc 300 

accatcagtc ctctgctcac ctgagacaaa tgcatgtctg attgcttcct ctgccctatt 360 

ggntatgtga aaatgcagat tcactgagcc agactaaggc atcagtgact ggtcctctac 420 

ctgcctctca catggagatt gggtattcag tgaaaggctg atcaaagacc caaaggaatg 480 

caacagttta tctcttatct acctatgacc tgcganctgc caccaccccc agntggngcg 540 

cctttccaga cagaaccagt gtacatctta cacgtattaa atngatgtcc cnggggctcc 600 

cnaanangna tcaaacaagc ngggcctcga ccaccttggg cacatatccc nanggacatc 660 

annctggagg ctngngncac tggcattggc cctnaccctn ggcaaaataa accttctaaa 720 

attggnaaaa aanaaanaan aaaaacctng nnccctntna naacnntacg 770 

<210> 447 
<211> 757 
<212> DNA 

<213> Homo sapiens 
<400> 447 

tggtatnntt tnaangctgg nngnnnggcn ccgttccaat gnctggganc nttggcaatc 60 

gctctttccg nangatccca tcgattcggt ctgatgcagg agaattgcta aaacccagga 120 

gggagaggtt ncattgagcc gagattgcgc cactgcactc tagcctgggc gacagagcaa 180 

gactccgtct cgaaagaaag aaagagaaag gaaattcccc agggaagtac ctcggcttat 240 

ttcataaaca ggtactgaag gaagcagagg catgtggagg acttccccac ctcgtgcagc 300 

tatttgggcc gtggcatctg aaatttctta tttcagagtc acccctttga tgaccttggc 360 

agtgaactgc agtcatctgt ttaggccttt ccatggccca cgtcaatgcc ggtatttctg 420 

tttgttgcac atttgatttc cttgttgttg gcatttagaa ggccccccgt ttcccagatc 480 

acaccacggg catggaccac agagattgca tcttgtgagt ctgtagaaat ggtcaaggcc 540 

ttgtcctctc ttaagtccag agctcangtt aatgcaaaat tttnccggnc atctgtgctg 600 
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aaatcccttt ggggaagctc ctggctggtt tcctgtaggt aggacagcta cacgtnctgc 660 
cctttattgg cttcttttca tgaagctcct gccatntacn aaacatgtct cccttcttga 720 
atcacatctc tggtattgna actctanaat cgcccgg 757 



<210> 448 
<211> 770 
<212> DNA 
<213> Homo sapiens 



<400> 448 

gggtgnnnng tttctaatgc ttggtngnnc nggnccnacn tttctaatgt tcggaanggc 60 

ttggctactc gntctttctn cangnagccc ntcggtncga attcggcacg aggtgtcttc 120 

atcttaccca gtggaaccta agaaattaaa ttctccagaa gaaactgctt ttcagacacc 180 

aaaatctagc cagatgcctc ggccttcagt gccaccatta gttaaaacat cactgttttc 240 

ttcaaaatta tctacacctg atgttgtgag cccatttggg accccatttg gctctagtgt 300 

aatgaatcgg atggctggaa tttttgatgt aaacacctgc tatgggtcac cgcaaagtcc 360 

tcagctaata agaagggggc caagattgtg gacatcagct tctgatcagc aaatgactga 420 

attttctaat ccttctccat ctacctctat tagtgctgag ggtaagacaa tgagacaacc 480 

cagtgtgatt tattcatgga ttcagaataa acgtgaacag attaagaatt tcttgtcaaa 540 

acgggtgctg ataatgtatt ttttcagtaa gcacccagag gcctncattc aggctgtttt 600 

ttcagatgcc caaatgcata tttgggcatt agaaaggtct gtcgcactta gtagcagcat 66 0 

cattttacag aggatagatt tggagttgtc cagacgacac taccagctat ccttaatact 720 

ttgttgacac ttgcaagang cagtcngaca agtactttaa cttcctcatg 770 



<210> 449 

<211> 792 

<212> DNA 

<213> Homo sapiens 



<400> 449 

ngagaaangt ttctaatgct ggnnnngnna gntcancctt tctaatgttc taatacttgg 60 

ganntcgaac tntcncnaca cagnnangcn ntgcgaattc ggcacgaggn cnnctcnatn 120 

atnacttgnt cncancggnc tggcatcnac ncgncacacc tacntnagcg cnttgtagcg 180 

caatatncac ctnntnaaac ccnnnagtcc cagggctctg ccnnnnnact gntcaactga 24 0 

cnaacnacnn nctancncaa cntnnnntta ngccnctgnc tgnctctatg gcacctnncc 300 

tnccntcncn cntnacccnc tacgctcagg gctatataca atgggaacct tnccaacagt 360 

aanccntgga tctnaggnat ggcccttgnc tggcggatca cagccttnna gcntatcagn 420 

atcttgagga agacaccatt ccgtcccnga ttntgaccaa ncnctcggat gtgnctatgg 480 

gctcnattga ggnacaacaa ctnncactgc nnataggcca tcctcnnnan nctacacatg 540 

ngactttncn nnncatntna aatgnnnana tgtctctcnc aagcatcacc cnctgtccct 600 

ncgncntcnt ggaagacctt ctgnncaact ganctccttc ntgnnncnnn ngattnttnc 660 

nnncnnaata tncntncccc aatgnccttg tnnngnattt atnangggnt ttccaatttg 720 

ggntaattca ntncccnccg nannctannn ncccatnaac cntcngngcc ttcttgnaac 780 

cttttnncct gg 792 



<210> 450 

<211> 848 

<212> DNA 

<213> Homo sapiens 



<400> 450 
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gnatgncccg 
annctgggng 
caagttgntc 
gaatcccttt 
gactgaaaat 
agcaaacgga 
gaccctgaga 
aaatacgntt 
acactggatg 
tncnccanng 
nnnnnngggn 
nnggnnctnn 
annnngggnn 



nnnnannnnc 



nnganccnnn 
aaacccnnnn 



nnnncnannn 



atttccttaa 
ntcgcaactc 
tctgttctag 
gtcctttttt 
acagaactca 
gatcganccg 
gaaagaactn 
cctgacactt 



aaaaaaanaa 



tgatggggnn nnnnngagcg anncttccga aanttccaat 
nctcnanaca gnaaggncgn gggctttgct ctctccattc 
aaagcagatg tagtagacat ctactgttgt tgcctgaaca 
tgntaaaagt actcatccct aatattcatt gtnctggaag 
caccatgatc ggccgggaca atcagattat ttcattccnc 
aaaagtggaa anatgagcnc ttctttggng ttggcatatg 
tnattntttc tcttggactg caataaagta tagctgccta 
ggaggnttgt ccacaatcgg ngaaataaag gcgagaccgn 
grmnccngnn gaanacccac tnnnccannn nccnnnccnn 
tanccgnnan naggccnnng cnntngcnnc nnngccnnnn 
gnnnnncnnn nnnnnnnncn nnnnannnnn nnncnncnng 



cccnncnncc cnncnccnnn nggnaanncc nnnnnnnarm 
ccnnnnnnnn cannncnncn cnnnnnggnn nnnnncnnnn 
acnnnngngn nnnnccnnnn nnnnnncngg nnnnncnnnn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



nnnnnnnnnn 



ncnngngnnn 



840 



nnnncccc 



848 



<210> 451 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 451 

gnnnnnnntt tcctaaatgc ttgggnnnnn nnngagngnn nttncnnagt ttcctaanta 60 

gcttnggcna ctcgttctnt ctncangcag nnnntgcgtn gncgaattcg gcacgagcat 12 0 

tcctcctttg ttaacgaagc aacatttaca caagatggac attacattat tagtgcatcc 180 

tctgatggca ctgtaaagat ctggaatatg aagaccacag aatgttcaaa tacctttaaa 24 0 

tccctgggca gcaccgcagg gacagatatt accgtcaaca gtgtgattct acttcctaaa 300 

aaccctgagc actttgtggt gtgcaacaga tcaaacacgg tggtcatcat gaacatgcag 360 

gggcagattg cagaagcttc agttctggta aaagagaagg tggggacttt gtttgctgtg 420 

ccctctctcc cgtggtgaat ggatctactg tgtaggggag gactttgtgc tctactggtt 480 

cagtcagtca ctggcaaact ggagagaact ttgacagtgc acganaagga tgtgattggt 540 

attgcacatc accctcatca gaacctgatt gctcctacag tgaagatgga ctcctaaagc 600 

tctggaaacc ataattcaac ttttcttttt taaatcaact cgaaagcatg tncttaaatg 660 

aacatattca tgtaangggc tttttttttt tgncactttt ctaagcaaat agatggctga 720 

attagtcacn gaataaattt gngaaaatca tggttaaatn ccaac 765 

<210> 452 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 452 

nnngnnnnnn ntttcctaaa tgcttggggn nnnnnngngn nnnnnttctn atgttcctan 60 

ngcnnnggng ctcgttctnt ctncacgnng ccngtgcggt gncggtctga ttgaaagctg 120 

ttcaggttta tcatgcaaat cctcgcctct ggctacggct ggctgaatgc tgcattgctg 180 

ccaataaggg gacttctgaa caagaaacta aaggccttcc cagcaaaaaa ggaattgtnc 240 

agtctattgt tggtcaagct atcatcgtaa aatagttttg gcatcacagt ctatacagaa 300 

tactgtttat aatgatgggc agtcttcggc cattcctgta ccagtatgga gtttgcagcc 360 

atatgtctca gaaatgcctt gttgctgctc ctgaagaaca gcaagatcca aagcaggaaa 420 

atggggctaa aaatagtaat caattaggtg ggaacacaga gagcacgaaa gcagtgaaac 480 

ttgcagcagt naaagccatg atggagatna attcattcca gcttcacctt cttctccatt 54 0 
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gagaaaacag gaattagaaa acttaaagtg ctccatactt gcttgcagtg cctacgtggc 600 

tctggctttt gggtgatacc tcatggcttt gaatcatgcn gatnaacttc ttcagcagcc 660 

caagctgcag gatctcttaa gttttgggac atttatatgc tgcagaaccc ttatcttctt 720 

cgacngaatn tctgtgccnt tctcacttga ccccgagaat gtnct 765 

<210> 453 
<211> 833 
<212> DNA 
<213> Homo sapiens 

<400> 453 

gngtnnnnnt ttctaangtt cntaatannn tnggctactc gttctttctc caggnatccc 60 

ntgcgattcg aattcggcac gagagaaacg ttctcaggtt gaccagctgc tgaatatttc 12 0 

tttaagggag gaagaactta gtaagtcatt gcagtgcatg gataacaatc ttctgcaagc 180 

ccgtgcagcc cttcagacag cttatgtgga agtcaagagg ctacttatgc tcaagcagca 240 

gatactatgg agatgaatgc actgaggacc catagaatac agattctaca ggggattaca 300 

agaaacatat gaaccttctt gagcacccca ggttttggca ttagaaaatg ggtacccctt 360 

ggttcaaaaa tgaacaaaga aagccttaga tttggatggg ggaacctgat ctgtccagtc 42 0 

tagaaggatt ccagtgggga aagtgtttcc atttccttng tcccctggct tggccaagga 480 

aagcgaaagc cctttcttga anagcaaccg tggatcattn gaccaggaac ttccttctgg 540 

ggtattaagc ttctttcaan tggaagggaa aggttccang gccaaaggaa aaatggaagc 600 

ccccaaccng atgggtttca ccctaantaa cctcaattgg aagggcttgg accaagaacc 660 

cnggaaaggc nanccattgc acccttaaaa ncaaggaaag tggaccacct ttggggcttg 720 

ncnttccntt ccgaaccagg ttgaaaangg gcttgaaaaa tggttgctta cccaaaaggg 780 

cgnacnttaa tggcaccaat tattcctntg gaccnttttt aatanccttt ngn 833 

<210> 454 
<211> 737 
<212> DNA 

<213> Homo sapiens 

<400> 454 

gnggnnnntt ctaagttcct aatnctgggc tactngttct ttctgcaggn atcccatcga 60 

ttcggcaaaa tcaatgtgga ctgaacataa atcacctgat ggaaggactt actactacaa 120 

cactgaaacc aaacagtcta cctgggagaa accagatgat cttaaaacac ctgctgagca 180 

actcttatct aaatgcccct ggaaggaatn caaatcagat tctggaagcc ttactattat 240 

aattctcaaa acaaaagaat ctcgcttggg ccaacctaaa gaacttgagg atcttgaagc 300 

aatgatcaaa gcttgaagaa agcagtaagc aagaagagtg caccacaaca tcaacagccc 360 

cagtccctac aacagaaatt ccgaccacaa tgagcaccat ggctgctgcc cgaagcagca 420 

gctgctgttg ttgcagcagc agcagcggca gcagcagcag cagctgcagc caatgctaat 480 

gcttccactt ctgcttctaa tactgtcagt ggaactgttc cagttgttcc tgacctgaag 540 

ttacttccat tggtgctact gntgtagata atgagaatac agtaactatt tcaactgagg 600 

aacaagcaca acttactagt acccctgcta ttcaggatca aagtgtggaa agtatncagt 660 

aatctggaga agaaacatnt taaccaggaa actgtanctg attttacttc caaaaaagaa 720 

gaagaggaga gccacct 737 

<210> 455 
<211> 718 
<212> DNA 

<213> Homo sapiens 
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<400> 455 

ggnnnnnntt tnnncngttn cntaaaannc tgggctactc gttctttctn cangnagccc 60 

ntgcgatncg aattcggcac gaggatgagg agtgtttaat cattgataca gaatgtaaaa 120 

ataatagtga tggaaagaca gctgttgtgg gttctaactt aagttccaga ccagctagtc 180 

caaattcttc ctcaggacag gcttctgtag gaaaccagac taatactgct tgtagtcctg 240 

aagagtcatg tgttttaaaa aaacctatca aacgagtata taaaaaattg atccagttgg 300 

agagatttta aaaatgcagg atgagctctt aaagccaatt tccagaaaag taccagaatt 360 

gcccttaatg aatttagaaa attctaaaca gccttctgtt tctgagcaat tgtctggtcc 420 

ttcagactcc tctagttggc ccgaaatctg gatggccttc tgcatttcag aagccaaaag 480 

gacgattgcc atatgaactt caggactatg ttgaagatac atcggaatac ctagctcctc 540 

aggaaggaaa ttttggttat aagttattta gcctgcaaga cctgttgttc tcgtcgctgc 600 

agtgtncaga ggatagagnc agaccacgtt ctaaaacnga gaaatcagaa gacatttnca 660 

gttatgtctc caaaagtgag tntcagctgt atgagttgac tctgctgaaa gtgacttg 718 

<210> 456 
<211> 739 
<212> DNA 

<213> Homo sapiens 
<400> 456 

gtggnnnntt ctnngtttcc aatangntgg gtctcgttct ttctnnacga tcnnntgcga 60 

ttcgcttggg aggctgagtc aggagaaatt gcttgagccc aggagatgga ggttgcagtg 120 

agccaagatc atgccactgc actccagact gggcaacaga gggagactcc gtctcaaaaa 180 

ctaaaaaaaa aaatncattt agtataccgg ggggtggggg ggagaaataa tgttatttcc 240 

tatgcgaaat gacgtgtatc cctgtaccca tgggtaaatg taaatatact gtgtctcttt 300 

tgggagagcc ttttagtaga ggagtcttat atgaagtctc tcataagtag ttcacttgag 360 

ttttgcagtt tgaaatctta aaggagcttt aattgacatt tattatacca attaagcttg 420 

gaatggggca atggatgcat ttccaaaacg tgtgaaagcc taacagctta tattgctgaa 480 

tgagaatctc ctgggtgtaa tttancactt agggaactgc gtgaacactc ccagccatta 540 

tgatgctggt accagcttta ntgtntaaat gccatganta ttctttctgn tctgttttgt 600 

gctctcttgg tncatttatt ttacccttta cngaataatt tcttgtaaaa tccntaaaaa 660 

tntttggcat ttaaaagtcc nntcttggan tnaananann nnnaanaaaa ancttncccc 720 

tttanaactt tngngggct 739 

<2L0> 457 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 457 

gtgnnntnnt tctnnngttt ccaattantc tgggngctcg ttctttctcn anncnnnnan 60 

tggttgncga attcggcacg aggnnanagg gnagctacat gnntnaccnt nttngnnctc 120 

tcagccangc tcnnctnnnn ctggtcctac tgctacatag aacacttgtt ntncnnggna 180 

actnntntat gtnnccnnga ntctctgnna ctngtttaaa tgctanttga taacaggcta 240 

tgcaaggnct gnaagtggan agcgtcatca ttcatcatnc ntnttanctn gantnnntgt 300 

atcctacatg ctttgattgg taaatgngcn tcagactggg actctcaata aatgnatata 360 

gangancttg ctgtggaaan ctgtcctctc ntatctntnc atgngnaant tccactncag 420 

tntgaactcc aaatgcnntn atnggnganc cctncttgta tagtggtgtc cattccaanc 480 

tgcnagggnc tagaaaccgt cggctntngg aaacnatggt gnnagttgan ctggtacang 540 

cngttntcac ctgcanctac cataaaatgg gnttacccaa gctttatcat ggaatggnta 600 

taaaaaacgc attnattgng cctttntaan cccattatnt gttnaatttn acttatggtt 660 
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ccccccattn aaattatnca attgggnann gangcttcna gtcnccatnt ttnaatggnn 72 0 

ttnncaaaaa aacgnttttt ttt 743 

<210> 458 
<211> 906 
<212> DNA 

<213> Homo sapiens 
<400> 458 

gnngnnnrmn nttnctaatg cttgggnncn cgtttctann nnnnnnnnaa nntttcctaa 60 

ttggttaggn gctcgnnctn tctccacnna gnnnngcggt gcgaattcgg cacgaggctg 120 

aatcaaggat cacaaactnc acatttngca cnttggtctn cacatncntg gttngggcag 180 

tcncagtnaa catggctntg gaaactnatn ttngnctngc ntcaaccatc tcgttcccng 240 

gggacccann ntccnnnatc ncgnnttncc tcgnnnatng gagngctnct tngnccannn 300 

atgggctccc nanaatangn ntnncnnngn nnatncancn ncngncaann ggntcnncnt 36 0 

nnnnnngccc tnntncctna tggnnngctn catgncccat nnnnnnggnn ancaataann 420 

naaanggtct nttcccncga nncccnnnnn ccnctaacan ngnacctcgc aaagggcccc 480 

aggcnttnnc tngnaaacca nnttngccaa nggtanttca aaggnngcct tngggacctc 54 0 

ccnannnngc cntggnnnta ccccgggnaa anggtngnaa acccnnccnn ngntgccnnn 600 

cccggncnng gaaanaaatt tccnnggnac ccagnntncc nccgnaannn anantannnc 660 

ccancccnaa cnttnngccc ncanccnttn gnnnntgnan tcnncnnncc ctttnnnntn 720 

nccaanncgg ccnggnnacn nncttnnacc tntttncncn naanngacnc caanntcctn 780 

nannaaaggg nggnnnnnnn nnncnttncc nnnggnagcc cnnnnnncct nncntnncnn 84 0 

aaaaattcnn cnntgnancn cccctnnnnt nangngnccc natnnnnnnn nnngnaaanc 900 

nnaccc 906 

<210> 459 
<211> 765 
<212> DNA 

<213> Homo sapiens , 
<400> 459 

gnngnnnngg nttcctaang ctggggnccg ntctnnnnnn nnnnnnnngt tcctaaanac 60 

ctaggngctc gntctngctc cacgcagncn gggcgtgggc gaattcggca cgagcttctg 120 

ttgattggtt tgtttaaagt acctaagtac taccctttga ctccctacca aaagttcttt 180 

tgttttttaa acaactttta tttgtgactt actttcttga gaagngttct taatgaattg 24 0 

cataaaatag tggtagcagc ttatttctta agtncttnat tattggggct ttaccattca 300 

ggtcttatct ttaaccctta tttactcagt tttccatctg aatgatccta tctctaaatn 360 

aaggatttaa taaatgctgc aaattgtcca ctttgcaaat ngtccaaaag ctttagtttt 420 

ggaccttgng aacttttttt ttaataacac attatttggg cccggtcgtg gtggctcaag 48 0 

cctgtaatcg cagcactttg gaatgcctag gcagacagat cacttaangc ctggagttcg 54 0 

agaccagcct ggccaatgtg gtgaagacct ccggttctat tactaaaaat nctaaaaaat 600 

tancaaggca tggnggtgca cgcctgnaat ctcagctact tgagangcaa atcnggagaa 660 

atgcttgacc ngggangcan anatganccn anattgcacc actgcattcc acctggggan 720 

nanantgaga anctggctca aaacccaaaa acccaaaaaa aaaan 76 5 



<210> 460 
<211> 677 
<212> DNA 
<213> Homo sapiens 
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acatagcaga 


ttaccatgtg 


aagttgccac 


tgctgcatct 


60 


agaggtctca 


ttttgtgtct 


gagaatgtcc 


aggttgtctg 


120 


ccattagcag 


ttattatttc 


ctggccattt 


cttcctgaag 


180 


gtcctcaata 


ttttatcagc 


agtagggctg 


tcattcttct 


240 


tgaagtaagg 


ttcaaccctt 


ctgcttttct 


caggccccca 


300 


aacacaaaaa 


cgggtattga 


gaaggaattg 


gcagggctca 


360 


tacctcatgg 


agactcttac 


tcatgctgga 


tttattgaga 


420 


acccccaccc 


actcttatgc 


agtctgttca 


ttcctgaaaa 


480 


cacacaaccc 


atgagggatt 


gctacttcct 


ataagattcc 


540 


gctgcgagaa 


ttacatttgg 


tcatgatgtc 


aagtgtctgg 


600 


aacacatagt 


aatttattgg 


aataattgnc 


atgatcactg 


660 
677 



<400> 460 
gtttncgctg ggagccacca 
cctgaaacct ggctgatggg 
cagaccacag cactgatttc 
gttttgtggt taaactccct 
ggttatcaac ctctacatta 
aaacggttcc tatccaatcg 
gtggctgttt ccgttgctcc 
gaacttctaa ctgaccactc 
caccactttc atccctcctg 
tcagtgagcc ttatagagtt 
tatgtagctn atgcttattg 
gatgagaata tagcccn 

<210> 461 
<211> 787 
<212> DNA 
<213> Homo sapiens 

<400> 461 

gnnnnnnnag ggnnnngngg ggcctcncaa agcccgngcn acaggtcccc gttccaaagc 60 

ntgggnganc gcnncgcccc ancagnaagg cgggggaang cggcacgagg acatcatcnn 120 

cttattctag taagagaaag tacacagatt caactttaga gaggacnggg gggnnnncng 18 0 

gagcnaaatc aaggaaggan tatcacgngg ccncccnnga atataannnn gaagctgnga 24 0 

acagnaccat cagnaacann nnatggacag ctctgatggg gnnnatacca cggcactctn 3 00 

cnnaccnnng gnggaagcna tccggagnna tgactgangn gnaaagnggn nnactgnnag 360 

aanccnggng ngctaggann ctgggagagn cactttcang aagnnaccng gcgangagnc 420 

atcanaagaa cccgganaag ngagaagacn ggaaaaagnn cncancgnac ngagcccagn 480 

nannnncnct gagccanggg ctncgaaang ccccaccnga agcnccatca canggnacaa 540 

ggnnngggaa aaggaancna cnnngcngac angnccnccn aanagngcca aancacngcn 6 00 

nngcccncnc gcccaaagaa nacnggacng cnggcncnna ncanaaggag cncnanggcc 660 

cnnggnaang aaactncnag nagcccaanc ccaaaggccc cnanggannn ccnncaaggg 720 

gaaaacanna nncacccaag gggcctgggc naanaaggcn ncccacncng gcccnccnnc 780 

nnnaccg 787 

<210> 462 
<211> 747 
<212> DNA 
<213> Homo sapiens 

<400> 462 



cttaattgnc 


ttgggcnnct 


cgctctntct 


60 


acgagcctca 


gccccacacc 


agctctattt 


120 


tgtgcttcat 


gggatttcga 


ttcgaagatc 


180 


tacaacaaaa 


tactaggtaa 


gtcactgcag 


240 


tgggtttgtg 


gagaatcaga 


gcatcttgac 


300 


ggccagggat 


cctgtggaac 


ctcttctagt 


360 


tctagtgaca 


tctgatgctt 


gctgtgaact 


420 


tctttaattg 


ccctgtattc 


cgaagggtaa 


480 


tgaatccccc 


ttttttcttt 


tctnctctct 


540 


tctaaatata 


ctgaaatgat 


ttanatatgt 


600 
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gtcaccaatt aatgatcttt tattcaatct aagaaatggn ttaagttttt ctctttagct 660 

ctatggcatt tcactcaagt gggacagggg aaaaagtaan tgccatnggc tccaaagaat 72 0 

tnntttatgt tttagctatt taaaaaa 747 

<210> 463 
<211> 750 
<212> DNA 

<213> Homo sapiens 
<400> 463 

tncctttcta angcnnntng nnaanngtcn ccgttctaan tncttgggca gnncgctctn 60 

tctncannca gncnntgcgt tgcgaattcg gcacgaggcg agatgaagct acactgtgag 120 

gtggaggtga tcagccggca cttgcccgcc ttggggctta agaaccgggg caagggcgtc 180 

cgagccgtgt tgagcctctg tcagcagact tccaggagtc agccgccggt ccgagccttc 240 

ctgctcatct ccaccctgaa ggacaagcgc gggacccgct atgagctaag ggagaacatt 300 

gagcaattct tcaccaaatt tgtagatgag gggaaagcca ctgttcggtt aaaggagcct 36 0 

cctgtggata tctgtctaag taaggattcc atatggctct catatcattc cattccatct 420 

ctgccaagat ttggataccg caaaaatttg tgttngngga agattctgnc tgaactcttt 48 0 

cattcaagga actactacca tgaatctgca ttctgntgcc cacactgagg ncttagtaga 54 0 

taattgggtg gtctgaaaca cctattatct cttatntctg gtctctangc tggnatgtta 6 00 

attcctctga aatgntaaaa gtaatgggtg anaccngaaa aagaaatttc aatnacagat 660 

caanntgggg ngcatgtatn attttcaagc gtcaaaatgg aataagggaa gantnctgga 720 

tacctgcttg gaaaaggaag natgtgtatn 750 

<210> 464 
<211> 748 
<212> DNA 

<213> Homo sapiens 
<400> 464 

gnngtgtctt tgnaaagcct ttggggaann gncnccttct aatgcttggc tatcgntctt 60 

tacgcagnnc ccatcgattc gaattcggca cgaggccggc cggcgacgct ggcgacgctt 120 

tcgcccctga ggtagtttgg cgaccgcgaa gaaggaaaaa gggcgggcgg gcggctgtcc 180 

tctcaccgtc ctcaccccgc gaggcccggc ccgctcctcc gtcgtggatt tcgcggcgat 24 0 

ccccccggca gctctttgca aagctgcttg aaacttctcc caaactcggc atggatacga 300 

ctgcggcggc ggcgctgcct gcttttgtgg cgctcttgct cctctctcct tggcctctcc 360 

tgggatcggc ccaaggccag ttctccgcag gttggntgct tctttcgttc tctcctctgg 420 

gggctctgaa gtttcaccag gtggacgctg gggagcgggc tcccgagcac ttgtctacct 480 

nccgccagtc ctgacaactt ttctggccaa cctacccagc ttcgcttggc tggcgagcgc 540 

atctgctgct ggggttcgcg gtgcaaatgg agacgcagtg gtggccagag ggtgatggag 6 00 

aagacgggaa aagcgacagc cacgctnctg gcttgaagcc gcaggacgca aataacttac 660 

tttggacctg acagttctac gttgntgtgg angccctgtt tcctggaaat aaaactcaaa 720 

atggtggttt tttggaaaaa aaaaaaat 748 

<210> 465 
<211> 863 
<212> DNA 

<213> Homo sapiens 
<400> 465 

gggnnnnnnn aanggnnnnn ggnnnnngtc ccgttccaan gaccnngaga tcgnngncgc 60 
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tccanaagaa 


aggcggtgng 


aattcggcac 


gagacctgta 


ccgcctggcc 


actggctgtc 


12 0 


accggcgtga 


tgagctgccg 


gtgtttgaac 


gcngcctatg 


cngggacttt 


cccggcanan 


180 


nggcnngaan 


atggccncca 


tncaggaagc 


cgcccagaac 


ctcctnggnn 


acacnacttn 


240 


agngccttcn 


agtccgntgg 


nacccggncc 


aagccccggc 


aancnctgcc 


ccgggtcncc 


300 


gttcccaagg 


ccaaccagcc 


ctgggnaccc 


ccgggqaqcc 


gaaacnctgg 


ggctnggana 


360 


ccgngantga 


gagncncact 


tttcnntgta 


nacacgggcc 


cagganacan 


ctntgctcgt 


420 


ggccccgggg 


naaannnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


480 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


540 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


600 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


660 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


720 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


780 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


840 


nnnnnnnnnn 


nnnnnnnnnn 


ncc 








863 



<210> 466 

<211> 713 

<212> DNA 

<213> Homo sapiens 



<400> 466 

ngtctttcga gcntggngnt cgttctngct cnannanatt ggttgnggga attcggcacg 60 

agcctcagcc ccacaccagc tctatttcag gggtgagagt cagagagcac tgcaatatgt 120 

gcntcatggg atttcgattc gaagatccta gaccagggag acactgtgag ccagggatac 180 

aacaaaatac taggtaagtc actgcagacc gacctccctg cagtttggga aagaagctgg 240 

gtttgtggag aatcagagca tcttgacatg actgctgacc taaagatccc tggcattggc 300 

cagggatcct gtggaacctc ttctagttca ggggtgtgag cattagactg ccagttgtct 36 0 

agtgacatct gatgcttgct gtgaactttt aagatccccg aatcctgagc acctcaatct 42 0 

ttaattgccc tgtattccga agggtaatat aatttatctg gatggaaatt ttaaagatga 480 

atcccccttt tttcttttct tctctctttt ctttccttct ccctttcttc tttgccttct 540 

aaatatactg aaatgattta gatatgtgtc aacaattaat gatcttttat caatctaaga 600 

aaatggttta attttttctc tttactctat ggcanttcac tcaantggac aggggaaaaa 660 

agtaattgcc atgggcttcc aaaagaattg ntttatgntt tagctatttn aaa 713 



<210> 467 

<211> 732 

<212> DNA 

<213> Homo sapiens 



<400> 467 

gnnnggtnnt ctaatncttg nnnnnnnntc ncccttctaa gccntggnct cgnctnnccn 60 

acnancnggc ttncgaattc ggcacgaggc gagatgaact acactgtgag gtggaggtga 120 

tcagccggca cttgcccgcc ttggggctta ngaaccgggg caagggcgtc cgagccgtgt 180 

tgagcctctg tcagcagact tccaggagtc agccgccggt ccgagccttc ctgctcatct 240 

ccaccctgaa ggacaagcgc gggacccgct atgagctaag ggagaacatt gagcaattct 300 

tcaccaaatt tgtagatgag gggaaagcca ctgttcggtt aaaggagcct cctgtggata 360 

tctgtctaag taaggattcc atatggctct catatcattc cattccatct ctgccaagat 420 

ttggataccg caaaaatttg tgtttgtgga agattctgtc tgaactcttt cattcaagga 480 

actactacca tgaatctgca ttctgntgcc cacactgtgg tcttagtaga taatttgggt 540 

ggtctgaagc acctattatc tcttatttct ggtctctagg ctggtatgtt aatcctctga 600 

tatgttaaaa gtaatgggtg agaccngaaa aagaaatttc aatacngatc aantttgggg 660 
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tgcatgttga atttgcaacc tcaaattgga gtaagggaan attctggata cttgctggaa 720 

aggaggaatg tn 732 

<210> 468 
<211> 748 
<212> DNA 

<213> Homo sapiens 
<400> 468 

gnnagnnttc taatngcttg tnnnnnniina gacgttctaa nnctttggcn atcgttnttt 60 

ctncagnann ccntcgattc gaattcggca cgaggccggc cggcgacgct ggcgacgctt 12 0 

tcgcccctga ggtagtttgg cgaccgcgaa gaaggaaaaa gggcgggcgg gcggctgtcc 18 0 

tctcaccgtc ctcaccccgc gaggcccggc ccgctcctnc gtcgtggatt tcgcggcgat 240 

ccccccggca gctctttgca aagctgcttg aaacttctcc caaactcggc atggatacga 300 

ctgcggcggc ggcgctgcct gcttttgtgg cgctcttgct cctctctcct tggcctctcc 36 0 

tgggatcggc ccaaggccag ttctccgcag gttggttgct tctttcgttc tctcctctgg 42 0 

gggctctgaa gtttcaccag gtggacgctg gggagcgggc tcccgagcac ttgtctacct 480 

tccgccagtc ctgacaactt ttctggccaa cctacccagc ttcgcttggc tggcgagcgc 540 

atctgctgct ggggttcgcg gtgcagatgg agacgcantg gtggccagag ggtgatggag 600 

aagacgggaa aaagcgacag ccaagctcct ggctgaaacc gcaaggacgc aaaataactt 66 0 

actttgnacc tgacagtttc tnacgtttgt tgtggangcc ctgtttcctg ggaaataaac 72 0 

tcaaattggt ggtttcttgg aaaaaaaa 74 8 

<210> 469 
<211> 776 
<212> DNA 

<213> Homo sapiens 
<400> 469 

gggngntcta atgcttgnnn tgattctccg tctataacng gntaatnctt ggncctacna 60 

aaaggctang ngaattcggc acgagacctg taccgcctgg ccactggctg tcaccggcgt 120 

gatgagctgc cggtgtttga acgcaaccta tgctggactc tcccggcaga ctgcctggat 180 

atggtcgcca tgcaggaagc cgcccagcac ctcctcggca cacacgactt cagcgccttc 240 

cantccgctg gcagcccggt gccgagcccc gtgcgaacgc tgcgccgggt ctccgtttcc 300 

ccaggccaag ccagcccctt ggtcaccccc gaggagagca ggaagctgcg gttctggaac 360 

ctggagtttg agagccagtc tttcctgtat agacaggtac ngaggatgac ngctgtgctg 420 

gtggccgtgg ggcttnaann tnannnnnnn nnnccnnnac caantctncn nannannnnn 480 

ccnacnnnta aaantnncnn ncnnnnncan nnnnnnnnnc cnnnnanncc nnnncttnnn 540 

naancnnnnn nnnnnnnanc nnnncancna nnnnnnnnna nnncnnncnn nnnnnnnncn 600 

nnnnnnnnnn nannccnnnn nnnnantnnn nnannnnnnn nnnnnnnnnn aaannnnnnn 660 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncncnnnnn nnnnnnncnn nnnnnnnnnn 720 

ccnnnnnnnn nnnnnccnnn nnnnnntnnn nnnnnnnntn nnnnnnnnnn nnnnnc 776 

<210> 470 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 470 

tatgnntttn ctaaaatncn tgggcaanac gtcctcnctt tcctaanagn ttnggcanaa 60 

cccttggcaa nacgccngtn acccanacnc agnnnggccg tggcgggcga gcgggcaaca 120 
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gctcttgagg 
tgctgagttc 
gaaggcctgg 
ggaatctgta 
tgctgccctg 
ttntcagatg 
aaattcgttc 
aactgangag 
ccagtcaaac 
tccatgctga 
acctncgaca 



agtgagactg 
atcaagagga 
gattttttgt 
gttcagcact 
ttagacatca 
agtaaaggac 
aagaaaattc 
aatgcannct 
ctacctacgn 
ggcgcaatca 
agaaggggan 



cnggagatnt gggccgtgcc aaagagatgg atgagactgg 
ccatcttgaa aatccccatg aatgaactga caacaatcct 
ctgaaaatca actgcagact gtaaatttcc gacagagaaa 
tgatccatct gtgtgaggaa aagcgtgcaa gtatcagtga 
tttatatgca atttcatcag caccagaaag tttgggatgt 
caggtgaaga tgttgacctt tttgatatga aacantttaa 
ttcanagagc attaaaaaat gtgacagtca gcttcagaga 
ggattccaat tgcccgggga acacagtaca caaagcccaa 
gggggactac tccagactcc cgnacncctt cacgtcctcc 
ccgcttctgg gncaagaagt tanaaacnct gggaaaaact 
catttanatt tacccnaaat gaana 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
765 



<210> 471 
<211> 820 
<212> DNA 
<213> Homo sapiens 

<400> 471 

cnnnnngggn nnngnggcgn cntccnaaan ccggggcgac agngccnnng ttccaacaga 60 

ccnggngngc cgncngngcc ccanacagca ngggnggggc nnnggggnnn cnncgncnnn 12 0 

cnnancnaca aagaactcaa caagaaaaaa acnaacccca caagcgggca aaggacgnga 18 0 

acagacantn cccaaaagaa gacatacaag caaccnaaaa taatcnaaaa taagnnncaa 24 0 

aaagaaaaaa ngcnagacag agnngngana gnactnagna aaaagngana tctagcggcn 300 

annagnangn nngnnnacgg ncngnnncna agaaanagnc nctggnnccc aagcnggagn 360 

acagcggcgc aagcnnggcn cactgcaacc gcgaacnccc gggctcaagc gaaccnccag 420 

cctcagcctc ccaagnagcn grmaaaggca ngcaccacca cacccgacna aaatancngc 480 

nancaanaac ananaanggc nccccngngc nnanncagga aanaaacacn cnnangcnnc 540 

ngaaaanaan naancncncn cnnnacaaaa aaacnnnagc cnnagaacaa nnnnggaggc 600 

ggaanacggn nnancccgac anganaanga nacnanngan gganganngg gaccaaaccn 66 0 

cancccggga anggcnnngn aaaaaaaang ccnnnaaann gggggaaaan ncggngnang 720 

ccnaaagggc cnnaaanggg gaaacccnan naaaangccg ggcanannan aaccnagcnn 780 

nancnanccn nccaangggg nannnccncn nncnaggccg 820 

<210> 472 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<400> 472 

gnngtgtctc taatgcttgg ctactngttc tttccgccga acncttgcta atgcttggcn 60 

ntcgttcttt ctccacnnac nnngcnntnc gaattcggca cgaggtcaca ganatnaaag 12 0 

tccaatcata ggggctggnc cnacntctnt gctnntccct gcangantca tangatcagn 180 

nanaccgtgc gnntttgnaa gcntttcaaa tgtgntacca tcgngttact tncnnnggca 240 

cctgntgann tnggttgnac tnnncnggat nctccaaanc caccnnnncn atgggntnng 3 00 

tgngcatgng ntggnncann nacagannna ganactttaa ngaannngnt tntgcaaccn 360 

tnggnnctag caancntgan antnccaggg nnggccacna agctgaaaat nnatgttana 420 

ncnnatgntg naatctctag natgacttcc ncannnancn aaactnangc anggctgcna 480 

tgttagaanc tanaggccna atttcttntc natgnaacca ntntatgctt ttaagaccnt 540 

caactgtnnc natgaagccc atntacatna ttncggtaat anggctatnc ttaaannnaa 600 

ctgctgaaaa tnatgatnca nctacgaaat cctnncancc ncatntggct naatcattac 660 

caaccatttg acaccnncat ngnctaccca cntgcattnc catgaccnan tccantgcca 720 
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cccgcncaga tntacctt 738 

<210> 473 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 473 

tatgnntncc taanagagtt ntgnnacacg gcccgccttc tnaaancttc ctaatncttg 60 

ggcgctcgtt ctntctncac ncagnnnntg cggtncgaat tcggcacgag gtccttttga 120 

accaccccaa agaactcaac atggcaaagc aaatggtaaa agcttcccga ctgttctact 180 

ttgggtccgc gcgaagccca ctcacgtgtg atctgtgttg cccctgggag gcccggggcg 240 

accggaaaag ggctctctca agttctgaaa agagaatctg ccaccagatc gaatttcgac 300 

ccctgagctt gttcggacgt atggtccaaa ttcagattaa ggtggtcacc caacccgaga 360 

tgtcaggaaa ggccttctgc agagaaaatg tccccccacc cgccatctgc agccaggtgt 420 

gtgccacacg gcagccttcc cgaaacatag tatggatttt aaaaatgtgt ntatttttgg 480 

ttctcaacca ctttataacg tattttttaa tttattttgt aatgtcttgt tttgaagtat 540 

tgctgctatc cttggtatcc ttcccactgg ttttatcact ganttatttt gngaaagttg 600 

ncactaatgt tctatgtcaa aatcaaaagt atttaatgaa atactanntc tatttaatgt 660 

ggntatggaa ccagctggaa acacaaaaca aacagtgatt gacancaagc tgggcccaag 720 

agncaggtca ttttgnacat atgccaataa ac 752 

<210> 474 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 474 

ttgcanacnn aatanttgct gtaaaagtcc cnnctttttn ccctttctaa tgnttgngcg 60 

ctcgntctnt ctccacnagn nnntgcgttn cgaattcggn tctnagccca tgccgggatc 120 

ttcccacacc cgtcctcaca gatccagccc cagcccctgt cttcccaggc catctctcag 180 

cagcacctgc aggatgcggg cacccgggag tggagccctc agaacgcatc catgtcggag 240 

tctctctcca tcccagcttc cctgaacgac gcggctttgg ctcagatgaa cagtgaggtg 3 00 

cagctcctga ctgaaaaggc cctgatggag cttgggggtg ggaagccgct tccgcacccc 360 

cgggcgtggt tcgtctcctt ggatggcagg tccaacgctc acgttagaca ttcatacatt 420 

gatctccaaa gagctggaag gaacggaagt aatgatgcca gtttggactc tggcgtagat 4 80 

atgaatgaac caaaatcanc ccggaaggga angggagatg ctttgtctct gcagcagaac 540 

tacccgnccg tccaagagca ccancagaaa gancctcanc cccagacagc acggnctaca 600 

cgcanctcgt gnacctggat gacntggaac anaatggtan cnaatgtggg accacngnct 660 

tgtancccna ggacaaggcc ctncnangct tgntggangg gtcnantcng anaaatggng 720 

gccactgccc aacccgcang aaganaacaa nn 752 

<210> 475 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 475 

gntttctntt aatncttttn naaangcgnn ntttaccntt ctangnntgn gnctcgttct 60 

ttcccacnna nnnnncggtn cgaattcggc ncgaggtgaa acagaaagtg gagatgcttt 120 

ccttgacctg aagaagcctc ctgcctccaa atgcccccat cgctatacaa aagaagaact 180 
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cttggatata 
gacagtgatg 
cggagctcac 
cgcaggatag 
cctcagagac 
tcangaagtc 
ggcagtggga 
acctgcggga 
anaanttttg 
cnnaanaaga 



aaagaactcc 
gtgtctggga 
cagtggaaag 
tagatccacg 
ngagctttgg 
cattagagaa 
ggataatctc 
cttgagagac 
gagaaagtaa 
acccgaatgg 



cccattccaa acagagcctt 
ccctgagaag tggcatgcct 
tctgaagaaa gagttggata 
agagcgtgtg aaagaagatg 
agggggctgc cacgtgacag 
agatagtgat gggcttcgtc 
tgcccggacc tttgagaagg 
agagaccnan anaaggactt 
ncntgtcttt tggtgancgt 
tt 



catgcctttc tgaaaaatat 
ctctctaccc agcttcaggg 
cagaccggcc ttccctggtg 
acttanatgt tgttctcagc 
ccgctgtcag ctcccggcgc 
tgcttggtgg acgtaggatt 
atcaccgctt aacgataagg 
caaggacaac gtttcangan 
anaaanaaat gattcttacn 



240 
300 
360 
420 
480 
540 
600 
660 
720 
742 



<210> 476 

<211> 1122 

<212> DNA 

<213> Homo sapiens 



<400> 476 



gnnngggnnn 


ttctaaaagc 


tgggnnnnnn 


nnngaggnnc 


ttctaatnct 


tctaatggtt 


60 


ggctctcgtt 


ctttctncac 


gcagcnnngc 


gnnncgaatt 


cggcacgagc 


ctgcagccac 


120 


taatgcattg 


tgtatgataa 


caaaaactct 


ggtatgacac 


attttctgng 


atcattgnta 


180 


attagtgaca 


tagtaacatc 


tgtagcagct 


ggttagtaaa 


cctcatgtgg 


gggtggggtg 


240 


ggggtgtatn 


cctngnggga 


nggnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


300 


nnnnnnnnnn 


annnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


360 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


420 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


480 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


540 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


600 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


660 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


720 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


780 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


840 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


900 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


960 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1020 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1080 


nnnnnnnnnn 


annnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nc 




1122 



<210> 477 
<211> 747 
<212> DNA 
<213> Homo sapiens 

<400> 477 

gnngtgcctt tgaaannccn tttgnnnnng nggcccttct aatgctnttn cgntcgnggg 60 

gtcgaactcg ccccacncng cnaggcgggg gctncaagcg attctaaacc acctatgagt 120 

atttctttta gggctcactt aaatacatgt ntgngnntac tgggggctag ccngaataat 180 

tttagatctg atcaggtngn ngctnaaatt ngaaaaanac cnnntngatg cttaaagaat 240 

tngcntccat ttttgagtct aaatctttta aaatntactg ngatccacat ctagngaaat 300 

gtcngtgtca anatattctn gatnatcgct naaatccnca ttaatactcn ttnggggtnn 360 

nnnatagngg aacttcntag nnntncnaaa agcacatngn cttcctgnct ccgctgctcc 420 

cacagnnggt nttgnaactg ggnaaatcag nnnnnngata gcgngngnnt ntnaganaaa 480 
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ntngatncac acatncttnn nnctcagncn 
ctgnggngga tgangttgga gttcgannga 
ccnngncttt cccntnccct cnctngaaag 
tganngnncn gtcrmgcccn cttcngncnc 
cnnngnnntn cgngnaaccg nnccngc 



ncacatngat tgaacactct ggccaagatg 
agaagccngc gctggcctgg cttgnaagac 
ctgcccngac ngaggccnaa ngnaaatggn 
ttngaaccnn nnagnggnnc tnnnngnacc 



540 
600 
660 
720 
747 



<210> 478 
<211> 746 
<212> DNA 

<213> Homo sapiens 
<400> 478 

gnnnnnngcg cgnccttcta atgcttcnta attnnctngg atactcgttc tttctncagg 60 

natcnnntgc gnttcgcaag gagnagagtg atagnaattg gcagtgaaat atacgaacca 120 

ccctcctgcc ctctgggttc acaatacgtg tacacttgac tgtgaagtgg ctgtgagagt 180 

gggtggagag ttcttctttg accctcagcc tgcggatgcc tctagaaacc tcgtgttgat 24 0 

tgcaggagga gtcggaatta accctctgct ttccatcctg cggcacgcag cagatctcct 3 00 

cagagagcag gcaaacaaaa gaaatggata tgagatagga acaataaaac tattctacag 3 60 

tgcaaaaaat accagcgaac tcctgtttaa gaaaaatatc cttgatttag taaatgaatt 420 

tcctgagaag attgcatgca gtttgcatgt tacaaaacag actacacaaa tcaatgcgga 4 80 

actcaagcca tacatcacgg aaggaagaat aacggagaag gagataagag atcatatttc 540 

aaaagagact ttgttctata tttgtggccc acctccaatg acagactttt tctccaagca 600 

actggaaaac aaccatgtac ccaaagaaca catttgcttt gagaagtggt ggtaggaggc 660 

aagaccaaag gcaggaaaaa attaangagg tgagatctac tcaaggagag ctcaaaaaaa 720 

aaaaaaaaaa actngggccc tttaga 746 

<210> 479 
<211> 750 
<212> DNA 

<213> Homo sapiens . 
<400> 479 

gnnnnnnnnn nngngnnnnn ttctannntt cntattnnct nggagctcgt tctttctnca 60 

ggatcccntg cgattcgaat tcggcacgag ggtagactgg ctagggatcc tggacccagg 120 

gttccacgta gcaacacctg ctgagttctc tgggttttct tcctgcctca tgtagcccag 180 

acttggagct gaagaagctg gaaacatgga aacaccaaca gctacagacc aaaaaaagtc 240 

ccaacaaagg cctgtcagtc tgccagcctg ttctgtggat ttccaactca agattgcagc 300 

atcaactcac acctgaagtt ctggcttccc tacaaacttt gaacttgcca gtccccacaa 360 

tggcataagc caattcctta aaatgaatgt ctagttctag ataatgtgtg tattctactg 420 

gttctgtttc tctggagaag cctactaata gatcatttgt cttagtcaat tcaagctact 480 

ggtacagatt accatagact gggtggttaa aactaccaat cttattactc acagtttttg 540 

gagtctggaa agtctgagat cagggttcca gcaggattga gttctttggt gaacatnctc 600 

tttctggnct acagaatact gggttacttt aagtnggaaa aagtagggtg aagctgggtc 660 

ntttggcctc ttcttttaag ggggactaat tcatgaaggg ttccaccctt attgacctat 720 

tttaccttnc caaanggntt ccattttccn 750 

<210> 480 
<211> 714 
<212> DNA 

<213> Homo sapiens 
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<400> 480 



gnnnnnngnn nngnggnnnt 
gctaggcttg ngaattcggc 
ttatttgaac cctataccaa 
acagtgccct gcatgaaggg 
gtggctttgc tctctccatt 
tctactgttt ttgcctaaac 
taatattaca ttgttctgga 
ggacaatcag attatttcat 
agctcttctt tggtgttggc 
gactgcaata aagtatagct 
tcggtgaaat aaangcaaga 
tcgaccttta nactatnnga 



tcnaccgttc ttntgaccta gnctcgntnt nnccnacnna 
acgagataac acacatcaca gtatgctctc agaaatttct 
tatctgntga tcaatgacca tttttgctca gcatggagaa 
tagtgagaat aaaaaggatc ttaccacctt tatcatgagg 
ccaagttgtt ctctgttcta gaaagcagat gtagtagaca 
agaatccctt tttccttttt ttgttaaaag tactcatccc 
aggactgaaa ataacagaac tcagcaccat gatcggaccg 
tcctcancaa acggagatcg atccgaaaag tggaaatatg 
atatggaccc tgagagaaag aactttaatt ttttctcttg 
gcctaaaata cgtttcctga ccttggangt ttgnccacaa 
cgtacacttg gatgaaaaaa aaannnnnnn naaaaaaaac 
gtcgatacnt aatcngactg atagatcatt gnta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



714 



<210> 481 
<211> 742 
<212> DNA 

<213> Homo sapiens 
<400> 481 

agccnttcta aangccnttt gnctnngnnc cccnttccta anncntggct aatncttggc 6 0 

nactcgttct ttctncacgc acccatcgnn ncgaattcgg cacgaggcat gaaaggagtc 120 

ggaagcggaa gcggtagccc ggacggtgct gtggtgcaag ggcttgtgga aaaattggag 180 

aaaaccaagt ccctggccca gcagttgaca agggaggcca ctcaagcgga aattgaagca 240 

gataggtctt atcagcacag tctccgcctc ctggattcag tgtctcggct tcagggagtc 300 

agtgatcagt cctttcaggt ggaagaagca aagaggatca aacaaaaagc ggattcactc 360 

tcaagcctgg taaccaggca tatggatgag ttcaagcgta cacagaagaa tctgggaaac 420 

tggaaagaag aagcacagca gctcttacag aatggaaaaa gtgggagaga gaaatcagat 480 

cagctgcttt cccgtgccaa tcttgctaaa agcagancac aagaagcact gagtatgggc 540 

aatgccactt tttatgaagt tgagagcatc cttaaaaacc tcagagagtt tgacctgcag 600 

gtggacaaca gaaaacagaa ctgaagaacc atgaagagac tctnctacat caccagaagg 660 

ttcagancca atgacaagac ccancaagca naagagccct ggggagccct ctgctgatcc 720 

caaanggcaa aaaatggggc cn 742 

<210> 482 
<211> 752 
<212> DNA 

<213> Homo sapiens 
<400> 482 

gnnngnggag nnttctaatg ctttgntcta gagtcncnnt tctaanggct tggnaatnct 60 

ngctcttgtt ctttntgcag gatcccatcg attcgaattc ggcacgaggc caagcctcgg 120 

cctccactgc acctgctgcg gagtggcacc tttgcctgca aggccctcta ccccatggcc 180 

cagtgtcatc tcagcagggt ctttggccac tcaggaggcc cttgtggtgg gttgctcagt 240 

ctgtccttcc ctcatgagaa gctactgctt atgtccacag accaggagga gctgtcacgc 300 

tggtaccaca gtctgacttg ggctatcagc agccagaaaa actagaggaa tcttatagat 360 

tccagaactc aggatacctc agggataggt cacagccaag agtacaaagg aatcttcagt 420 

actgaacaaa acagaaccct tcatgatttg acaaaggtca ctttctgttt gcctggacca 480 

agctactcca gatcatctga ccaactctta aaaatcacgg ccaggcacag tggctcatgc 54 0 

ctgtaatccc agcactttgg gaagcaaang tggcaggatc attccagccc aggagtttca 600 

agancagcct ggcaacacag tgagtgagac cctgtctcta tttaagaaaa aaattattaa 660 
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gaaattttat taaaaaagga agaatcagga aaccaaagtc aaccccaact taaccctcaa 720 

tgaaccagcc ctaacacaga tgangggatt tg 752 

<210> 483 
<211> 849 
<212> DNA 
<213> Homo sapiens 

<400> 483 

gnnnnnattn ccctttnaaa tnccncngaa ancccttgga agcactaccn ctcngacccc 60 

tttggaacgn cgactnctnn atatatcnng gatataatag gtgataagtt ctgncaatta 120 

gtaacatcng gaaaaaacag ctnngncctg ggngaaaaag gatgccaaaa tngcctggaa 180 

aagagcagng gagaggagtc cgggagatgn gngatgcatc gggacgcanc atngntnaac 240 

attcactggg tctgccaaaa atgtggattt gngggctgct tagatngtta caaggcaaaa 300 

ggaaaggaaa gagttctaga gataaaagaa ctatatgctt ggatgaagtg tgtgaaggga 360 

cagcctcatg atcaccaaca tttaatgccc aacccaaaat tataccnggt tctgntttga 420 

cagacttcta gatgccatgc acactcttag ggaaaaaata ttgggattaa ancccatngg 480 

cattggacta acaaacagga atttacaagg tnggaaantt ttncnaccaa tgaaaggggg 54 0 

gatcncaagg ttttccagaa nggntcntaa tcncaggnaa taaaaattnc tctngggcaa 600 

gccctgagtc ttaancagca aaaanactcc tcccgaancc tgnagaaaaa agggggggca 66 0 

gccaggcccn naaanggaan gtnaggcccn agatnaacaa ngtnacctcc ncccagnaaa 720 

ccccannccc caactggnac cngggnaacc cacaacnttt gcngaagncc aaaaaagncc 780 

nnnagangga aaaaaaaaaa naananaaaa aacctnnnag cccctaagaa accttagggg 840 

nggcccncc 849 

<210> 484 

<211> 1098 

<212> DNA 

<213> Homo sapiens 

<400> 484 

gnnnnnnnnt ttnnnnnttt ttgnaaaanc ccccttttgc naaatngncc ctttttntgg 60 

cangggatcc ccatntttat ntcggacatt ttcgggccac cggaaggggc cgggggcccc 12 0 

cgggccncca ggnccgggna aaggcccccc ttgggcggcc cccggncggc cccaatgggt 180 

tccaaaaagg gaaaaaaaaa aaagggggaa cctgggaagt tggcccanga aaangnaaaa 24 0 

aaaggnaagn aaaccttccg ccaatgggaa tggggaaaaa taattttttc ttgaaaaacc 300 

caaaaaagga atggttattt ttcaaattta aaaaaggaac nttgggaaga aagaattggc 360 

ttcccacncg cagaaagggc attactggct atgtcaagta aaagaagtcc ttcaaagctt 420 

agttgatgat ggtatggttg actgtgagag gatcggaact tctaattatt attgggcttt 480 

tccaagtaaa gctcttcatg caagggaaac ataagttgga ggttctggaa tctcaagttg 540 

tctgagggaa gtcaaaagca tgcaagccta cagaaaagca tttgagaaag ctaaaattgg 600 

ccgatgttga aacggaagag cgaaccaagg ctntgcaaaa agagcttttc tttcactttc 660 

gagaccaaag gggaaccagc tnnaagggcn agaaaagttn gaaaaaaatt ccaaaggaac 720 

tggtggaatc ccccaaaagg tttggttggg gaaagaaaaa ttcccgcccc aangccaaaa 780 

tttaaaaggt ttngccccca aagggaaaag ncttgncctt taacccagga attggggacc 840 

ctgggantta aaaccnataa ttttcccgcc naattnnaaa aaattcnttt nggggncccc 900 

naaaanggna aaaaaatttt nggggggttt tggnaaggna aaaatttnaa atttggattt 960 

ngaaactttt ttngggaatt ccccagaaag aacttttgac cttccnttng acctnaaaaa 1020 

ttttcccttg ggggggtgna anggatgttc ccaagctttg tggnatattg gtaaaatttt 108 0 

naaccttttn tncttacc 1098 
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<210> 485 
<211> 798 
<212> DNA 
<213> Homo sapiens 

<400> 485 

gnnnnnnant nnnntttnaa atccttnntg aatcctttga antaccatcc cnttttncga 60 

attnggcacg aggaaaggtg gcgcgcttct cacggctgag ttgctgcgcc ttgcagacgg 120 

aagctcccca caggcagagc tgcttggatg tgtgagtcat gaaccagaga agccccgctc 180 

catgagcagt gactccccan gccctgtgac ctccctcctn cttgcagctc ctcctggcac 240 

cagtccccag ggctctcctg ttggtagttc ctgcttttct tcttggaaat tcctcgtgga 300 

cctcgagatc tttaccctaa aatagttctg ttgaatttca ccctggcaat gtaaattgat 360 

agcttatctt cacagatgcc agacaatgga caactcacca tcagtcctct gctcacctga 420 

gacaaatgca tgtctgattg cttcctctgc cctattgntt atgtgaaaat gcagattcac 480 

tgagccagac taaggcatca gtgactgttc ctctacctgc ctctcacatg gagattgtgt 540 

attcagtgaa aggctgatca aagacccaaa ggaatgcaac agtttatctc ttatctacct 600 

atgacctgcg aactggccaa caacccagtt gttgncgcct tttcagacag aaccagtgtc 660 

atcttacacg tattnaaatg gatgtcctgg ngtctnccta atatgtattc aaaagcaagc 720 

tggggcctng accacccttn ggcacatatt cctcanggac atcattcctg angctgtgtc 780 

actggcatgt ccttaanc 798 

<210> 486 
<211> 785 
<212> DNA 
<213> Homo sapiens 

<400> 486 

gnnnnnnttt gaaanccctt tcnaatnctt ggcattgntc tctttgcagg atccctcgat 60 

tcgctgacaa cttgattggg ttctccttca ggtttgaagc gccctcgaga agtgtctaaa 120 

ggagacagtt gatagccaaa caacagtttt ggattcactg actgattatg aaagaagcag 180 

tagactggta tcaagaatca gtcagcaagg aggccctcac cagacgccag tgccatgttc 240 

ttggacttct cagcctccat attcatgaac taagtttttg gaatccttag gcttccacgt 300 

gtggaaagcc tgagctaacc tactggagga tgagccatca cctggagcag attcaggcca 360 

tcctagttga agcctcccta ggccaagcaa ccgtccaact accagacatt gaccattcag 420 

ccttgaacat tcagcacaaa gacaaaacag accagaccag aagagtccca cagaataggg 480 

gaaactattc agagaaaact taagccacta agttttatgg tgttttgttc tgtagcagaa 540 

gcataggcat actgacaata caaaccgaaa tccttctaac gtagtggacc ttttcangcc 600 

agcatttttt ccttgaaaac ctggagcatg tatccatctt atagcagaga tcactttcac 660 

aatggttggg ctcttggatt tgaattgatg atgtaatgag ccctctttnc ngattgnaac 720 

ttaattactc tgggnatttg ntggattccc aaccttctaa tatttacttt tcctctttan 780 

taanc 785 

<210> 487 
<211> 797 
<212> DNA 
<213> Homo sapiens 

<400> 487 

ttgtnnnncc cttttnaaat ncctttggct anttgntctn tttgctngat cccatcgatt 60 

cgaattcggc acgagnnngg actaccttnc aaaaccnggt ngggaagcnt gttacagaan 120 

tgatntctan tcccctgnat tctggatgct gcagaccaac acctgccnac aanacncana 180 
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cacacacann caancantat catgtaagac agnncgntna ntnnnnnatt ntnatncttn 240 

nncatttacn cantnttgta nantggntca tgngtctata natnnttgta antattntnt 300 

gananangac ganantctga atcttaagca tatgctccat cnttnnatat gctntggtgg 360 

agaggctngc cntnattcat nttnncatgg agncaagttt aatgcctcta gantacattc 420 

tgggcttcaa gcatncttat ttnnaactcc ctgagtgatg ggtggataaa tcnaacattg 4 80 

nctnagtggn ntcaagacaa ctttgntggt ggttttgntc acaatcatga aaatggttnn 540 

gccagataaa tattttgata ttagntttcn tttttnatat anngcggtag gtttgaattg 600 

nacnttnaaa tgnntngggt tgtnaagaca ntggnttnca atnnaattta tnacatgaat 660 

tggngnctcc cctttggnga aaccttaaag aanttntgna tacttcttca taaaagggtg 720 

tgngatttng naantttcgg gggttttnaa tttttnntga agcttatttc ntganaatnt 780 

acttggntta ccaagcc 797 

<210> 488 
<211> 762 
<212> DNA 
<213> Homo sapiens 

<400> 488 

caaatcnntt gctctngttc tttttgcagg atcccatcga ttcgcgacag ctctccaata 60 

ctcaggttaa tgctgaaaaa tcatccaaga cagttattgc aagagtttaa tttttgaaaa 120 

ctggctactg ctctgtgttt acagacgtgt gcagttgtag gcatgtagct acaggacatt 180 

tntannggcc caggatcgtt ttttcccagg gcaagcagaa gagaaaatgt tgtatatgtc 240 

ttttacccgg cacattcccc ttgcctaaat acaagggctg gagtctgcac gggacctatt 300 

agagtatttt ccacaatgat gatgatttca gcagggatga cgtcatcatc acattcaggg 360 

ctattttttc cccacaaacc caagggcagg ggccactctt agctaaatcc ctccccgtga 420 

ctgcaataga accctctggg gagctcagga aggggtgtgc tgagttctat aatataagct 480 

gccatatatt ttgtagacaa gtatggctcc tccgtatctc cctcttccct aggagaggag 540 

tgtgaagcaa ggagcttaga taagacaccc cctcaaaccc attccctctt caggagacct 600 

acccttcaca ggcacangtc ccccaaatga gaagtctgnt acccctcatt tcttnatctt 660 

tttacttaaa ctcaagaggc agtgacaggn agtcaggggc aagacattac atttttcata 720 

ctttcccaca tctgaaaaga tgacagggga aactgcaaag cc 762 

<210> 489 
<211> 822 
<212> DNA 
<213> Homo sapiens 

<400> 489 

ttnnnnnnct nnnggnnttt cnaatncttg tttctcgncc tttctgcagg atcccatcga 60 

ttcgaattcg gcacgaggat tttcgaaact cttcagctac ttgccctttt ttatctgaaa 120 

ccatcatacc ttctgaaaga aaaaagcata tcttcattga cataacagaa gtgagatggc 180 

ccagtcttga tacagatggt accatcntnt atatggagag tggcattgtg aagataacat 240 

ctttagatgg tcatgcatac ctctgcctgc ccagatctca gcatgaattt acagtacatt 300 

ttttgtgtaa agttagccag aagtcagact catctgcagt gttgtcagaa acaaataata 360 

aagccccaaa agataaacta gttgaaaaaa ctggcaaaat ctgtatacgt ggaaatttac 420 

cangacagag actgaagaat aaagaaaatg agtttcattg ccagatcatg aaatccaaag 480 

aaactttaaa gaagatgagt tgtgtaaatg gaactgaagg gagggaagag ctgccttcgc 540 

ctggtacaaa gcacacatgt gtatacacat gggtcaagca gtgctggtct gtggctgcct 600 

gtccagagga atgggaaata ttcctttgtc tttagcactt catttttcta aataaaaatc 660 

anccaatatg tctaaaaaaa aantttnttn ataataaacc tngaagccct nttanaacct 720 

tntnntggag gtcctnnttt accntatgat tcccggaact tggataagga atcccntttg 780 
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gattgganat tttgggccna aaacccncna nncttggaat cc 822 

<210> 490 
<211> 789 
<212> DNA 

<213> Homo sapiens 
<400> 490 

ntgtaancct tttcaaatcc cttggctact tgntctttct gcaggatccc atcgattcga 60 

attcggcacg aggccggacn gtgactctgg nnacgcttgc gnccntnacg tagntngnng 120 

accntgcang anggaanaan ggctggccnn cngntgtacn ctnaccgtcc taaccccgcg 180 

aggtccaggn ccgctccttt cggngnggat tctcgcggaa natccctccg gcagctcttt 240 

gcaaagctgn ttagaaactt ctcccaaact cggcntggat acgactgcta tagggctcgc 300 

tgctgctttt gtggagctct tgctcctcta tccttggcct ctcctgggat acggcccaag 360 

gccaagtntt cacgcangtt ggtacgctta tttcgttctg gactctgggg gctntgaann 420 

ttcaccacgt ggactgctgg ggancgggnt nccgancact ngnntacctt acnccanaat 480 

ctgacaactt ttctggacaa cctacccanc ttcaattggc tngngagcnc ntcngntgct 540 

ggggnntncn gtgcaaatgg agncncaatt ggtgggcaaa tngttgatgg ncaaaacggg 600 

aaaaagcaac nnncaangct tttggctnaa agccgatang acncaaatta nttnctttgg 660 

accttganaa tttcctcaan nnttttnagn anncnctttt ttncttggan aaanacttaa 720 

aagtgaacga ttnttgggaa anaaacaaac tataataact naaagctttt ntaaaaaaaa 780 

annaatnnt 789 

<210> 491 
<211> 790 
<212> DNA 
<213> Homo sapiens 

<400> 491 

tccaaaatnc ccttggantn attccccctt ncaatacctt tccttngnac actcccngtt 60 

tngntngatc ccatcgattc gaattcggca cgaggnaaca aagaaggaat gtcttcctca 120 

tgtttnggtc tatagaagac gttaaagaaa acttccagaa agtgggtttg aggcatgagc 180 

caccacgcct ggccaaagga tttaatgaat taatggatgt acagtgctgg ggctgttatt 240 

ctagggcctg cattgagact cacattttgc catcaaaagc cttttaagag gtggaggttg 300 

cggtgagctg acatggtgcc actgcactcc ggcctgagtg acagagtgag actctgtctc 360 

acaaaaaaaa taatgccctt taaataatga ataatagtga tagaaaatgt catttcttgg 420 

acaaatgaaa aattgaaatt aatgtatata attagatatt attagctact cttaggtagc 480 

ttcatttgtt gaaagtttga caagtgaatg aagttcacat ctggaaatcg ttgaacattt 540 

ttcgttcatg gaactcaatg gctacgttag tcgtttatgc ttttcactgt tgtggtaggg 600 

gctttggaaa gtnaatgcca tcaacaatgg atacagaang acctggattt ggaataaggg 660 

caaaaattta ttttgatggg gctgaattgc tctgccaggg agcattttgg gtattgagat 720 

gaaaatggcc tctctttgag actgagctgc cacctggcaa attattgnct gcttaanggt 780 

tctctttatn 790 

<210> 492 
<211> 804 
<212> DNA 
<213> Homo sapiens 

<400> 492 

tcnaaatccc ttttgnnagn ttcncncttt gtttcccttt nctnggctnc ttgttctttt 60 
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nnnnnnaaaa 



tgcaggaatc 
caacatggca 
cccactcacg 
ctcaagttct 
acgtatggtc 
ctgcagagaa 
ttcccgaaac 
aacgtatttt 
atccttccca 
tgtcaaaatc 
ancttggaaa 
caaggttcat 



ccatcgattc 
aagcaaatgg 
tgtgatctgt 
gaaaagagaa 
caaattcaga 
aatgtccccc 
atagtatgga 
ttaatttatt 
ctgtttttat 
aaaaagtatt 
cacaaaacaa 
ttggttacca 
aancttggng 



gaattcggca cgaggtcctt ttgaaccacc ccaaagaact 
taaaagcttc ccgactgttc tactttgggt ccgcgcgaag 
gttgcccctg ggaggcccgg ggcgaccgga aaagggctct 
tctgccacca gatcgaattt cgacccctga gcttgttcgg 
ttaaggtggt cacccaaccc gagatgtcag gaaaggcctt 
cacccgccat ctgcagccag gtgtgtgcca cacggcagcc 
ttttaaaaat gtgtttattt ttgtttctca accactttat 
ttgtaatgtc ttgttttgaa gtattgctgc tatccttgnt 
cactgattta ttttgtgaaa agttgtacac taatgttcta 
taatgaaata ctagttctat ttaatgtggg ntatggaacc 
acaggggatt gtacaagcan gcttggggcc caagnaaggt 
tatgccnata aaacctcanc gaanttttaa aaaaaaaann 
ggct 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
804 



<210> 493 
<211> 800 
<212> DNA 

<213> Homo sapiens 
<400> 493 

ggnncnnttt nccccccttt tgaaaacccc ttttgggnga ancccncttc tttnaaatcn 60 

cttggctact cgctctttnt gcaggatccc atcgattcga attcggcacg agtatataac 120 

aacttttgct ttcaaagttg ggtgggacta gaacacacaa tggaaggatg gagtcaggag 180 

acctggattc ttgtgcccgc tctggctttt acagtctgcc taactctatg cagtcacttc 240 

ctgccagcct gtttccttac ctacaagagg gagagacact ccctggccag cctagttctc 300 

agggtgaacg aaaggtcatt atcactgcat cctctagtca tttgcttctt cgctaattaa 360 

cacatcttga gcacctgcga tgttccagga acaggagatg gcagcgtgca agataaaagt 420 

ccctgacttc tagagactgc atgttagtgg caatcggcgt ctacccggcc ttcaataaac 480 

tactgaatga aggaaaattc tacctagcac cagacacaat tactgggttt ctaaaatgga 540 

attattcccc cggccccctg catccagcag cctgctgcag ggaagctcct ccgaagctgt 600 

aggcaggagc gggacaaatg cttgctatca gcttcacaga atgttaccta agtactattc 660 

ctacacagcg ccttacagaa caaacagtaa aaaccaaatg gnaagcatgc acnggcttaa 720 

aaactcaaac ttcctaacta ctcagtaatt anganggtca ttttacccca aaatagaatt 780 

ttcnatttat ccaataanaa 800 

<210> 494 
<211> 757 
<212> DNA 
<213> Homo sapiens 

<400> 494 

nggnttcnnt ctaactnaaa cngttnggna actcncctct ntctgtngat cccatcgatt 60 

cgctaacaag cgattctaaa ccacctatga gtatttcttt tagggctcac ttaaatacat 120 

gtttgtatat actgtattct agccagaata attttagatc tgatcaggta gtagctaaaa 180 

ttagaaaaaa acaaaataga tgcttaaaga atttgcatcc atttttgagt ctaaatcttt 240 

taaaatatac tgagatccac atctagtgaa atgtcagtgt caaaatatta tagattatag 300 

ctaaaatcca gattaatact catttggggt tttttatagt ggaacttcat agtaatacaa 360 

aaagcagatt gtcttcctgt ctccgctgct cccacagtag gtattgaaac tggtaaaatc 420 

agttttttga tagtgtgtgt atataagaaa aaatagatac acacattctt ttttctcagt 480 

caacacattg attgaacact ctggcaaaga tgctgtggtg gatgangttg gagttcgaaa 540 

agaagaagca agcgctggcc tgccttgaaa gaacccgaaa gtctttccca ttcacttctc 600 
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tagaaagctg ccaagacaga ngcagaaagg aaatggatga tagttctgtc aagcacactt 
ctgntctcnt agaacttaga aatggttcta agagaacaga agttatngag aacagttcnt 
gtggaattca acatcttggg tgggacncat tggcttt 



660 
720 
757 



<210> 495 
<211> 756 
<212> DNA 

<213> Homo sapiens 
<400> 495 

ggnnnnnntc ttttcnaatg cttggctctc gttctttntg caggatccct cgattcgcaa 60 

gagagagtga tagaattggc agtgaaatat acgaaccacc ctcctgccct ctgggttcac 120 

aatacgtgta cacttgactg tgaagtggct gtgagagtgg gtggagagtt cttctttgac 180 

cctcagcctg cggatgcctc tagaaacctc gtgttgattg caggaggagt cggaattaac 240 

cctctgcttt ccatcctgcg gcacgcagca gate tec tea gagagcaggc aaacaaaaga 300 

aatggatatg agataggaac aataaaacta ttctacagtg caaaaaatac cagcgaactc 360 

ctgtttaaga aaaatatcct tgatttagta aatgaatttc ctgagaagat tgcatgcagt 420 

ttgcatgtta caaaacagac tacacaaatc aatgeggaac teaagecata catnaeggaa 480 

ggaagaataa eggagaagga gataagagat catatttcaa aagagacttt gttctatatt 540 

tgtggccacc ttcaatgaca gactttttct ccaagcaact ggaaaacaac catgtcccaa 600 

agaacacatt tgctttgaga agtggtggta ggaggcagac aaaggcagaa aaaattaaga 660 

ggtgagatct actcaggaga gctcaaaann aaaaaaaaaa aaactnggac ctntagaact 720 

atagtgagtc gtnttccgta gatccagaca tgataa 756 

<210> 496 
<211> 744 
<212> DNA 
<213> Homo sapiens 

<400> 496 

ctttnaatcc cttgcactcg tcttntgnag gaccttatcg attcgaattc ggcacgagat 60 

aacacacatc acagtatget ctcagaaatt tctttatttg aaccctatac caatatctgt 120 

tgatcaatga ecatttttge tcagcatgga gaaacagtgc ectgeatgaa gggtagtgag 180 

aataaaaagg atcttaccac ctttatcatg agggtggctt tgctctctcc attccaagtt 240 

gttctctgtt ctagaaagca gatgtagtag acatctactg tttttgecta aacagaatcc 300 

ctttttcctt tttttgttaa aagtactcat ccctaatatt acattgttct ggaaggactg 360 

aaaataacag aactcagcac catgategga ccgggacaat cagattattt cattcctcag 420 

caaaeggaga tcgatccgaa aagtggaaat atgagctctt ctttggtgtt ggcatatgga 480 

ccctgagaga aagaacttta attttttctc ttggactgca ataaagtata getgectaaa 540 

atacegttte ctgacacttg gaggtttgee acaateggtg aaataaaggc aagaegtaac 600 

actggatgaa aaaaaaaaan nnnnnnaaaa aaactcgagc ctntagaact atgtgatcga 660 

ttegtagate cagaatgata gatcattgtg agtttggaca accacactng atgcagtgaa 720 

aaaatcttat tgngaattgn gatn 744 

<210> 497 
<211> 772 
<212> DNA 

<213> Homo sapiens 
<400> 497 

gnttgnngtn taantttnta aggatccctt tntntgaanc cctttctgca ggatcccatc 60 
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gattcgaatt cggcacgagg caggagnaat cacttgaacc ctggaggttn cggttgcagt 120 

gagcacagat catgccactg cactccagcc tgggcaacaa aacgagactt cgtctcaaaa 180 

aaaaaaaaca tagaatttgg atcctttggt cgggttctcc caaattcttt tgaggtgtcc 240 

atggtcaact gcttcagctt tgttttggca accccctgcc cgaagtcgca tataggctgt 3 00 

tcttcacctt gtttccaagg ctgaggaaca gaaagtagcc tctgttttga ggaggtggaa 360 

gttaagtata catttatttt ttactgtgac ttgttcagga ccacatttta caaaatgcct 420 

tgtttccttc attgtttctg gaaaggaaag ttctattaat attgntttac tttgaatata 4 80 

gaatagtttt tttaattagg gcttattttg aaaaattctg agtttaattc aaatgtatgc 540 

caataccttc caaagtaagg taatattcag agacagttgt tggtgatcag atggcttaga 600 

gaaaatttct ggaatattca cattcgaaga tccttattat gaatgtcttt gacttaaatc 660 

taaccaaaaa ctgcacatta ttctttgnac attttcatta tatagngtta acaagcttan 720 

ttgcaaacca ataaatactt aagctattta aaaaaaaaaa aaaaaaactc nc 7 72 

<210> 498 
<211> 773 
<212> DNA 
<213> Homo sapiens 

<400> 498 

nttnagcnta nnagccgttg tantgaagcc cntttgctac ttgctctttt tgcaggatcc 60 

catcgattcg aattcggcac gaggacccag gtagaccagc tcaagagttc atgttctttg 120 

tcatcctcct gtgagctctc tgtaagtctc tntcttgccc atcaccacat ccctagtact 18 0 

gggtatcagt ctggccactt ggctttctgg tttgccccaa tgtggtctat tcttgatgca 240 

gctaccaaag taatgttnta aaaccattat accaagttac tatccttgtc aaaaccccca 3 00 

gtaactgcca atctcactta gaataaaatc cggactcctg tgaagcacag nataaactgg 360 

cactgcctat gcagcaacct catctttacc gtttctgcct tgctcactcc cttcagcgcc 420 

ggtattcttc ctgatgcccc tagtacacaa caactccttc ctgctccaag agtaggaaaa 480 

tnactgtctc tctgccagtg agattcctct tctggtatta cctntgcttc attgctgaat 540 

cttctgcaat atcatcttct aaaaagagcc tttnaaaatc accttttcta ttatgcccta 600 

ctcantttcc agtccctgaa tggccattcc ccactttcat agccacttaa ttgctatctg 660 

aaattacact taaaatggtc accttcatga tgggaaggca attaattgcc tttgtcactg 720 

gtatgtctag agaacaagca gnttggctca tagtaggcac tcaacaaaaa ttt 773 

<210> 499 
<211> 735 
<212> DNA 

<213> Homo sapiens 
<400> 499 

gcttcaatan ctttttctaa ngctcttttt gcaggattcc atcgattcga attcggcacg 60 

agagtaccca nanttgcnag gagtntnntn actgatntag ccaggtggca atnatgagtg 120 

aatggatnaa naaaggcccc ttagaatggc aagatnncat ttacnnagag gtccnagtgn 180 

canccagtga cangaatgag tttnaaggga tgggttttaa ctacagaccc agnctctgcc 240 

aatatngacc ttgtgaactt ccttgaagat ggcancatgt ctgagaccgg aattatggga 300 

catgctgtgc agactgttga aactntgaat gaaggggacc atagagtgag ggataagctg 360 

atgcattttg ttcacgtctg gagactgcaa agcatacagc ccacaggatc tggaagagag 420 

aaagaacagc ctanagnaaa tggctngaga ngaaccacat tcccatcact gaacagggan 480 

.acgcttcaag gactctctgt gtggctgggg ncctgactat ngacccacca tatggtcana 540 

naaattncac cagctctnat gagantattn tgtcgcgtgt tcaggatctt antgaaggac 600 

atcttacant ttnccaanna naagncatga aatgtgacat tctgcttgaa naagacnata 660 

ttttatcctc atnaatgttt aaatgtaaaa nnnnananaa aanactcgag ctntnaaatn 720 
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tngtgagttn anang 735 

<210> 500 
<211> 926 
<212> DNA 
<213> Homo sapiens 

<400> 500 

tttaagccct ttctactnct cttttgcagg attccatcgn ttcgaattcg gcacgaggat 60 

ctctatacta gtgaacagtg ccagttccac actttggact tagaactgtt ctctagttat 120 

tgtaacacag aatactgtca atccctaatt tacttaatgt tacttattgg aagtggggct 180 

gatgaaatac gcacaggagg gaaatctact gtgtttaggc acaggcagnc ccagtgtata 240 

aggagatcat attccaaang gttgtcagtt ggntgtttgc aacctggaat gtattttcct 3 00 

ttagagacca ngttatccat ggtggttagg cccctagagc agctggaaaa agatgatcaa 360 

accaataggt tngctgacat cnaataatgt aataagtttg ctaaaggaat ctaccatcaa 420 

atrxtnatatt gnttccaggg aaggttgtnn nttaanntnc cntcttngtg ncatantgga 4 80 

cnntcccntn ccagtcatnt ncntnannnc tngggcnngt ntngnnttng tntntttngn 540 

cnnctnanca atatttcata tcncccctng ctaaaattct ttnanannaa nttctcantt 600 

tctcccttta ctanaanttt ngtntttnnt ccntttanta tttnnnccta tntntntcgt 660 

tcnnanatnt cattnnntnn ttntnngctn ntnnatcacc cttanctcnn tctcanntat 720 

cntnntcnta ttatctctnt attnntcnct tntnatnatc nttccnnntt gtntanncna 780 

ttatntcttg ttnntntnct cncatctctn tcntnttctc ngctnannnn actccnnnnn 84 0 

tcncnctcnt nnnnanatnc atatnctnct ttngntatat annnnnntnt ntacntanct 900 

cnnnatnnca tnncnatatn nttngt 926 

<210> 501 
<211> 706 
<212> DNA 
<213> Homo sapiens 

<400> 501 

naatncttgg ctcttgttct ttntgcagga tcccatcgat tcgaattcgg cacgagaatg 60 

caaagggctg cagttctcat tcaggctact ttcaggatgc acagaacata tattacattt 120 

cagacttgga aacatgcttc aattctaatt cagcaacatt atcgaacata tagagctgca 180 

aaattgcaaa gagaaaatta tatcagacaa tggcattctg ctgtggttat tcaggctgca 240 

tataaaggaa tgaaagcaag acaactttta agggaaaaac acaaagcttc tattgtaata 300 

caaggcacct acagaatgta taggcagtat tgtttctacc aaaagcttca gtgggctaca 360 

aaaatcatac aagaaaaata tagagcaaat aaaaagaaac agaaagtatt tcaacacaat 420 

gaacttaaga aagagacttg tgttcaggca ggttttcagg acatgaacat aaaaaaacag 480 

attcaggaac agcaccaggc tgccattatt attcagaagc attgtaaagc ctttaaaata 540 

aggaagcatt atctccacat tagagcacag tagtttctat tcaaagaaga tacagaaaac 600 

taactgcagt gcgtcccaag cagttatttg tatcagtctt attacagagc tttaagtcca 660 

aagatatcaa atatgcacgg gctgcacact aatcagtctt ctatca 706 

<210> 502 
<211> 784 
<212> DNA 
<213> Homo sapiens 

<400> 502 

ttnntttttt tggttaccct ttgctctngg nctttttgca ggatccctcg attcgaattc 60 
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ggcacgagcc ttccacggtt atttcacaga tatggagagc tggaagcagg gagtgagtct 120 

ctgagtgttg gaattgtaag ggatcagaag cagggatcag aagcagtggt gaagttcatc 180 

caccataaaa cacacaggtg actttgcctt gaatctgcag gactgaagcc aactcttggg 240 

cacagaccct tagtcccttc cttggccact ctaagtcaga tagtccagag ccaggccctt 300 

tgggatgtga caccgagata aatcagagaa aagctgtgaa gcttggggaa cagagggact 360 

tttggtgaag taggtggtct gcagtttcta tcttcttggg aaaagcaagc tggaaaagtg 420 

aacagtggtt ggtaggccat agtgctccca gctgggtgac ataatgacca cacagcacag 4 80 

tgatgttatt agcaactgtg tggtggagta gttgtgggct ggacaaatca atcgtgtgga . 540 

aattgttagg agttttatta cattaaactt gttaacctaa aataccatca aaaaaaaaaa 600 

ntncnnannn nccncccacc nancntncna aaaaaancct cganccttta aaaacnnntn 660 

gnngaggccn tatttacgtt anattccaga cnttgaatan ggatnccatt tgnattgaaa 720 

ntttngggcc aaacccccaa ccttngaatt gccattngaa aaaaaaatgc cttttatttt 780 

gnnt 784 



<210> 503 
<211> 764 
<212> DNA 

<213> Homo sapiens 



<400> 503 

ttnntnttcc ttgaancctt tttctacann cncctttgca gatcccncgt tcgaattcgg 60 

cacgagagac aaagaaaagg tggcaatcat agaagagttt ntagtaggtt atgaaacctc 120 

tctaaaaagc tgccggttat ttaaccccaa tgatgatgga aaggaggaac caccaaccac 180 

attactttgg gtccagtact acttggcaca acattatgac aaaattggtc agccatctat 240 

tgctttggag tacataaata ctgctattga aagtacacct acattaatag aactctttct 300 

cgtgaaagct aaaatctata agcatgctgg aaatattaaa gaagctgcaa ggtggatgga 360 

tgaggcccag gccttggaca cagcagacag atttatcaac tccaaatgtg caaaatacat 420 

gctaaaagcc aacctgatta aagaagctga agaaatgtgc tcaaagttta caagggaagg 480 

aacatcagcg gtagagaatt tgaatgaaat gcagtgcatg tggttccaaa cagaatgtgc 540 

ccaggcttat aaagcaatga attaaatttg gtgaagcact taagaaatgt cattgagatt 600 

gagagacttt tataggaaat cactgatgac ccagtttgac tttcatacat actgtatgan 660 

ggaanattac ccttagnatc ttatggtggg actttattta aaaacttnca nnaatgttcn 720 

ttcgacagcc ttccatttta acttcnaagg cnncaangaa ttnt 764 



<210> 504 

<211> 795 

<212> DNA 

<213> Homo sapiens 



<400> 504 

ttgtacntct ttttnnaaac ccntngctac ttgttctctt tgcanggatc cctcgattcg 60 

ggaatctcct agaaagttgt gattttcgag ccatatcctt ctgtggtaga tcctaatgat 120 

cctcagatgt tggccttcaa ccccaggaaa aagaactatg atcgagtaat gaaagcactg 180 

gatagcataa cttctatcag agaaatgaca caagcaccat atctggaaat caagaagcaa 240 

atggataaac aggaccccct tgctcatccc ttactgcaat gggttatatc aagtaataga 300 

tcacatattg tgaaactgcc agttaacagg caattgaagt ttatgcatac tccacatcag 360 

ttccttcttc tcagcagtcc accagccaaa gaatccaatt ttagagctgc taaaaaactc 420 

tttggaagca cctttgcatt tcatggctca cacattgaaa actggcactc ctcctganga 4 80 

atggtctggt ngttgcttct aatacacgat tgcagctnca tggngcaatg tatggaagtg 540 

gaatctatct tagtccaatg tcaagcntat cattttgntt actcagggat gaaccangaa 600 

acagaaaggt ntcagcccag gacgagccac cttcaagcng ttaanaagcc agcaattaca 660 
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ttcacagtcn ccaggaaana aaaggncagn cctatccccc ctttncctgg caaaaggccc 720 
gtnaacctta aanaaactgc ctttagccct ttatnntgga aagtggattc ncncttnatt 780 
cttggacccc tgncn 795 



<210> 505 
<211> 774 
<212> DNA 
<213> Homo sapiens 



<400> 505 

tnntntnntt nantngaacc ctttnctctt gctctttttg caggatccct cgattcgaat 60 

tcggcacgag cacaaggaga agaagttaat taacattgaa ngatgagaag acatcttgga 120 

agaacttgaa ttgggccttg gaagaagaac agccattcaa atagatagaa ttgtggtagc 180 

aaaggcatag aggtaggaaa gtatagatct ccagggacag tagtcatggg gttggggcac 240 

tgttggaatt taaggttgga aggatatatt ggagcccctt gaatacggta acaaggcaca 300 

ccttgggcag tggagagtta tcagagtgtt tgaaaaggag ggttattgag taaataaata 360 

gactggtact ttaggaattt taaaatgtgg atcattgtac tactaataac tatttatttt 420 

atatttacta tctactaagt aatttacatg tattttcttg tactgactgt aaaccttctg 480 

ggtgtgggtg ttttaagtgc cattttactg atnaagaaac tgaggcttaa atagttgaaa 54 0 

taagtcaccc tgttagtgag tggccagaat gacaagtcag atctanggtt tgtctaactn 600 

ccaaagatria tataaaaata atggatctct ccttttccct tatgcataaa atatggggag 660 

cntttttaaa tcattaccca tncgattgnc caaaaaaata cctttnggga aaactgatta 72 0 

ttantattcc anaataaatt tcaacggcct gcntngnctn ctttacaact ttnt 774 

<210> 506 
<211> 796 
<212> DNA 
<213> Homo sapiens 



<400> 506 

gccnccccnn tttngntctc aacttgtacc ctttttgcan nancncgnnc tncttgcagg 60 

ntcccatcga ttcgaattcg ccacgangtt atattaaatt attctttgtt tttctttttc 120 

ttttaataaa gcctgcaagt tactaaattg tagtttcata aattctgtag taaagtatca 180 

tcttggcagt gtgccaaagg tgaaaatgat gctttctcta acagagaaat tcttagtgac 240 

tccagtcgta gaaaaacgtc tttacaacct gaataagatt gaagaattgt gaacatacca 300 

tggcctattg gatgaatcat ttgccgtagg ctaaatcaga ctgtagggtt tgtgatggat 360 

ttatggagta tgtgggtata gaaatcatga atctagcatt tgttttcaga gattcaagca 420 

tagtcttaag ggtanatcag aaatgacaaa tgaattcaaa acctagcagg tgcattgtna 480 

atgtgtgccc agttntgttt tggaaatggc agttccttgg ggtcatgttt ctactggcaa 540 

aatttgcaat antgtnctat tgtntgtaat ttcaaaattt ataagattat cccccgttcg 600 

cccaagtaaa acctgtnctg cccaatanaa tcctggantc gnngagaaat cgcntccatt 660 

cgnngntcaa ctcgggatnc ntcgncttaa naaaatnttn tccnggancc ccntcatnan 720 

gaanaacacc anactattnn gggnacctgn aangctcaat ngcccnngcc ncnnangncn 780 

nttttccngg naannn 796 

<210> 507 
<211> 774 
<212> DNA 
<213> Homo sapiens 



<400> 507 
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ctnntttntt ttngaancct tngctcttgt tctttttgcg gatcccatcg attcgtgaag 60 

aggagacggt gacctgggct ccttatgtgc ctgaaagagt ttgagtttcc tgttaactcc 120 

aaatcaacag tattttcaac aagaaatgtg caattgaaat caagtgctgt ttaagtgcag 18 0 

ctaggatttc cacaggaaga cacttgcagt gaacagagtt atggagcagc aaaaacacag 240 

atctatttgg aaaaagagaa aacatatgcg ttgtattttg cttcaattat aaaataccat 300 

cctctcaaag gtggttctaa attacaaagg actttgattt ctaggtagat tctgggtaga 36 0 

gacttccttt catattgagg cattaatgac accttttaac ctgggaagca atatgactgg 420 

agttgtactt tgagaagatt aatcaggttt ggttgcagaa tgaaagagaa gatgaagtca 480 

agagattggt ttagaggctc tagcagaagc ttagtcatat ttcaaaatga tcaaatatca 54 0 

agaaaaattc tgagctgcat aacttgtata aagtaatttt cagtgatttt ttcatggtta 600 

tgatnaaaga actggattta nccagaaacc tttacctgga ttcaagattt aatttttcct 660 

ttgagcctca tccttaaagg attttcggga aaacattaag gggagccaaa nccnattggn 720 

tggttgggcn tgccctnnaa ttgcctttgg acttttttaa ccgggctttt gnnn 774 

<210> 508 
<211> 724 
<212> DNA 
<213> Homo sapiens 



<400> 508 

cttgcctttg aaaancgttg gctactngtt ctttttgcag gatcccatcg attcgaattc 60 

ggcacgaggc ggcgctgacc cggccggccc cacacccgct cttcctcttc tttgccgcgg 120 

actccctttc ctgcctccaa gacctggtgt ctcccactgt gagcccagct gtcccacagg 180 

cagtccccat ggacctagac tcaccttccc cttgcctcta tgaacctctg ctgggcccag 240 

cccctgtccc agctcccgac ctgcacttcc tgctggactc aggcctccag ctccctgccc 3 00 

agcgagcggc ctcagccacc gcctcccctt tcttccgggc cctgctgtca ggcagctttg 360 

cagaagccca gatggacctg gtgcccctgc gaggtctgtc gcctggtgca gcctggcctg 420 

tcctgcatca tttgcatggt tgtcgggggt gtggggctgn rmtggggccc gtgcccacac 480 

cangcnancc cctgtatggg atcanaggcn cgaagangca ntgnangctg ntggcanntn 540 

aantactgnc tgggctggaa nangaactnn taaaagtcnt ngcccnnatc . caccttggna 600 

cccnannttn nnccnntant cnnngggntn angtggtnnn nnc tngggac agntcnntnt 660 

ggnntgncna tngnncnnat gnanacttgg ggttcannaa ncntttccnn atgnaancng 720 

ngtc 724 



<210> 509 

<211> 803 

<212> DNA 

<213> Homo sapiens 



<400> 509 

tnnnnnttta tttcnttcgt tctngntttt attacatcag ctcttttctt tttgcggtcc 60 

ctcgttcgca attcagagac acacataaga aactggaaga agagaaaggc aaaaaggaaa 120 

aagaaagaca ggaaattgag aaagaacgga gagaaagaga gagggagcgt gaaagggaac 180 

gagaaaggcg agaacgggaa cgagaaaggg aaagagaacg tgaacgagaa aaggagaaag 240 

aacgggagcg ggaacgagaa cgggataggg accgtgaccg gacaaaagaa gagaccgaga 300 

tcgggatcga gagagagatc gtgaccggga tagagaaagg agctcagatc gtaataagga 360 

tcgcagtcga tcaagagaaa aaagcagaga tcgtgaaagg gaacgagagc gggaaagaga 420 

gagagagaga gaaccgagag cgagaacgag aacgggagcc gagagagaga gcgagagagg 480 

gaaccgggag cgagaaagag aaaaagacaa aaaacgggac ccgagaagaa gatgaagaag 540 

atgcatacga accgaaaaaa aaaaaaaaaa aactcgagcc tnttaactat agtgagtcgt 600 

attacgtaga tccagacatg ataagataca ttgntgagtt tggacaaccc ccacttgaat 660 
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gcagtgaaaa aaatgctttn tttgtgaaat tttgngatgc 
tttttagctt gcaataaaca agtttnccac caaccanttg 
gttcaagggg aagtttttgg aag 



tnttgctttt 
cnttcatttt 



tttgtaacca 
nttntttcan 



720 
780 
803 



<210> 510 
<211> 789 
<212> DNA 
<213> Homo sapiens 

<400> 510 

gntttnnnnc nnttttaatn tacatacanc tacttgttct ttttgcaggg atcccatcga 60 

ttcgaattcg gcacgaggga acccccacca ttaagctaaa gtaaaaccct tttgagggaa 120 

gagggagact ggggagaagg gaaaagagag aaggcaggga gagtagggag agaaaacctt 180 

ccagcagccc agtaaactgc gggcgaagag atctacccgt ctccctccct cccacagtta 240 

ccattggcct tgtcatcgca agcatttgac aaagacttgc ttgtttgggc ctgtcacctc 300 

ctgaaaggct gctttagctg tggatgccct tgattaaggg agagagcgcc taggagctgc 360 

ctgccccanc tggggtgacg gctgtagggc tgggtctatg ttgcaagccc tatatcctan 420 

catgcagtgg aaagtgctta gctctctccc tcctgacctc tgggcagcca gtcatcaaag 480 

cagagagacg tggcggcatg tgggcagcat gcccaggttc cttgctgact cagcacttat 54 0 

ttctgtagtt ttaaaaaaga atttaatgtt tttggttgta tttttttggg ggggtgaggg 600 

tgggcaaaaa catgggggta gttctgagtt gttagaaatg tttctgaatc aagtttgttt 660 

gaaaacacgt tgtgcctttg tacccattat aagatggtca taanacccaa gaactgataa 720 

gctttgggtt ttttttggtt tggtttggtt ttttgcttca ttttacccat tcatgcctag 780 

ggtttccat 789 

<210> 511 
<211> 776 
<212> DNA 
<213> Homo sapiens 

<400> 511 

catanagntc ttgccttttt gnaggacnct cgattcgaat tcggcacgag cccccatctt 60 

cactggttat tccacttatt taaaatgtcc agaataagca aatctccata tagaggaagt 120 

agattagtgg ttgcttcggg atgggaggaa tgggaagatt gaggtctttc ttttgcagtg 180 

ataaaaatgt cctaaaattg actgtagcga tggtcacaca actctgaata tgcttaagac 24 0 

cattgaatta cacactttac gttggtgaat tgtatggatg taaattatag ttcaataaca 3 00 

tagttacaaa agataatcaa aagcatgaaa gcactgttga tgtggnttgg atctgtgtcc 360 

tcaccgagtc tnatgttgaa atgtaagccc cctggtggga ggcgatggga ttatggggca 420 

gantcctcac aaacgggtta gcccacccgc tcaggctgtt ctcctgatat tgagtcctca 480 

tcacatctgg ttgcttcaaa gtgtgtggng ccttccctct atctcctact gctctggcca 540 

tataagangt gcctgcttct ccttcgcctt ntacatgatt gtaaagtttc ctgagcctcc 600 

tagaacnaaa gctgctgngc tttctgtcca tctacangan cgtgagccca attaaacctc 660 

tttttttttt ttnngaggnn ntttnntnnc nntccnnnca ntttnanann cctngnanng 720 

gtttnnaaaa anaananngn naannnnnnn nncccccngc ccttttaaaa taaaaa 776 

<210> 512 
<211> 917 
<212> DNA 
<213> Homo sapiens 

<400> 512 
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ttatttcata aactattgtt ctttttgcag gatccatcga ttcgaattcg gcacgagggc 
tgcgaggttt tcggctttgg ctcctgatat gcagcgacag aattttcggc ccccaactcc 
tccttaccct ggtccgggtg gaggaggttg gggtagcgga agcagcttcc ggggaacccc 
£T99cgggggc ggaccacggc cgccctcccc tcgagacggg tacgggagtc cgcaccacac 
gccgccgtac gggccccggt ctaggccgta cgggagcagt cactctccgc gacacggcgg 
cagcttcccg gggggccggt tcgggtctcc gtcccctggc ggctaccctg gctcctactc 
caggtccccc gcggggtccc agcagcaatt cggctactcc ccaaggcagg annanaanca 
nccncanggt tntncaagga catntacacc atttggatca nggcgtntta naaaaaaaan 
aatgttaatg anttggaaaa ntatttnaaa gcctttnaat gnttnnnnna atccttnggg 
nttggcctta naaanccaan attntngtng gngggntntt aannccnnnc aantncnnnn 
nnattncntt naaaacnttt nnnccanggn crmaaaaaaa nggggnaann aaaaaacttt 
tttnnttnaa nnantttttt tggaaaattt naaancntng gaaaancntt tnnntngttn 
ntnangggaa annantnttt tgggnncnaa aaaacntttt naannnntnn nggttnnnan 
nnnttaaaaa ntttnnnccc ccaannnnnt nnanngnanc ttttnnantt ngggantaaa 
nttnnnnnna nggggnnttt tttnngnnna atttnnnnnn annnimnnan nnangggnnt 
ttngnnngna annntnn 



60 
120 
180 
240 
300 
360 
420 
480 



540 



600 



660 
720 
780 



840 



900 
917 



<210> 513 
<211> 780 
<212> DNA 

<213> Homo sapiens 
<400> 513 

tnnnnnnttt aaatccatta gctacttgtt ctttttgcag gatcccatcg attcgtgcgg 60 

gagcacccga gcctgcggct ccagacggac gcccgcaagg tgaggtgcat cctgacaggt 120 

cacgagctgc cctgccgcct gccggagctc caggtctaca cccgcggcaa aaagtaccag 180 

cggctggtcc gcgcctcccc ggccttcgac tatgcagagt tcgagccgca catcgtgccc 240 

agcaccaana acccgtangt ggtccncggc ggcgcgggga ggcccagggc aatnngacag 300 

nccctccgnt tgactccgcc agtgctgcag nccctactct ttcanagttg ggagccctgg 360 

gacccaggca ccaattgttc ttgcaaactc accctgcggc acatcaacaa gtgcccanaa 420 

cacgtgctga ngcacaccca aggccggcgg taccagcgag cttttgtgta aatatgaaga 4 80 

atgtctnaag caaggggtgg agtacatgcc tgctgcctgg tgcacccgan gangaagang 540 

gaaggacaaa tggacngtga acggccttcg cccgcgggaa agcttctggg agcccacatt 600 

caatgatgaa gggggagctg caagtgatga cagcatgaca gacctgtncc cctgactttt 660 

caccagaagg accttgaaca cngaggatgg ggatggactg atgatttttg acaacaaaga 720 

ggttgaaagg caaancccca aaaaaaaggc cttgtgaagg cagganaaan acaacctntc 780 

<210> 514 
<211> 793 
<212> DNA 

<213> Homo sapiens 
<400> 514 

tttnnnngnt ttannncatt ttgctactng ttctttttgc aggatcccat cgattcggaa 60 

ttatagtatt gacgtgaatc ccactgtggt atagattcca taatatgctt gaatattatg 120 

atatagccat ttaataacat tgatttcatt ctgtttaatg aatttggaaa tatgcactga 180 

aagaaatgta aaacatttag aatagctcgt gttatggaaa aaagtgcact gaatttatta 240 

nacaaactta cgaatgctta acttntttac acagcatagg tgaaatcata tttgggctat 300 

tgtatactat gaacaatttg taaatgtctt aatttgatgt aaataactct gaaacaagag 360 

aaaaggtttt taacttanag tagccctaaa atatggatgt gcttatataa tcgcttagtt 420 

ttggaactgt atctgagtaa cagaggacag ctgtttttta accctcttct gcaagtttgt 480 



140 



WO 99/33982 PCT/US98/2761 0 



tgacctacat gggctaatat ggatactaaa aatactacat tgatctaaga agaaactagc 54 0 

cttgtggagt atatagatgc ttttcattat acacacaaaa atccctgagg gacattttga 600 

ggcatgaata taaaacattt ttatttcagt aactttnccc cctgtgtaaa gttactatgg 660 

tttgggggta caacttcatt ctatagaata ttaagtggga agtgggtgaa ttctactttt 720 

tatggttggg gtggaccaat ggctatcaag agtgacaaat naaggttaan ggatgattcc 780 

caaaaaaaaa aaa 793 

<210> 515 
<211> 770 
<212> DNA 

<213> Homo sapiens 
<400> 515 

cttattncat nnagctcttg ttctttttgc aggatcccat cgattcgaat tcggcacgag 60 

gttgtattgg aaagcagtag tgtggacgaa ttgcgagaga agcttagtga aatcagtggg 12 0 

attcctttgg atgatattga atttgctaag ggtagaggaa catttccctg tgatatttct 180 

gtccttgata ttcatcaaga tttagactgg aatcctaaag tttctaccct gaatgtctgg 24 0 

cctctttata tctgtgatga tggtgcggtc atatttatag ggataaaaca gaagaattaa 300 

tggaattgac agatgagcaa agaaatgaac tgatgaaaaa agaaagcagt cgactccaga 360 

agactggaca tcgtgtaaca tactcacctc gtaaagagaa agcactaaaa atatatctgg 420 

atggagcacc aaataaagat ctgactcaag actgactctg atagtgtagc attttccctg 480 

ggggagtttt ggttttaatt agatggttca ctaccactgg gtagtgccat tttggccgga 540 

catggttggg gtaacccagt gacaccacac tgattggact gccctacacc aatcagaact 600 

cagtgcccaa tgggccactg ttttgactcg gaatcatgtt gtgcactata gtcaaatgta 660 

ctgtaaagtg gaaanggatg tgccaaaaaa ttaaaaaaaa ccnccaaaaa agcttccaaa 720 

aaaaaacctt taaactatag tgagtcgtnt acntagatcc aacatgataa 770 

<210> 516 
<211> 825 
<212> DNA 

<213> Homo sapiens 
<400> 516 

tttccagttt tanttttttc ancttttnga tcnntttgca ggatccntct tttcgaattc 60 

ggcacgagat tctccctaaa ttgtngatcc cactgtttac naaactgttc tnttgtgctg 120 

gcntgctnan tgctntgtag nncctttctg nacnntaggc attgctcttg gagaacnnga 180 

tgtgctttnt ntnaaanggc anaccagngn tgnnctgnnt ttaatgatgc agancctnac 240 

tttatccaca cctggcccgt ttnacatttn agtaangnac gatatttggc tgatggctga 300 

acantttctg aaatacacnt ttagtgtatg gaantacaag accnntaaag gnctgccagg 360 

ttancatctc atctngcatt cnnntccttt ggcnanaaag gganatntca gaattatatt 420 

tcttgatggg gtcttttcaa tcantgtatc tgtcgaaann tcttaganaa anctatgtgn 480 

tcncggtgtt gtctaaaaan atnctttcaa anatgacccc tggaattncc tgananangc 540 

ttaaacgtga gaagacnggt nggcaaaaca ccctncnaag gttnttggna angcccnant 600 

ntgttttgtc tggcccatat aancttngcn ccattnaagc cncgggngag ctttgnatnt 660 

atattngngg ngttactttc tttgnncctt tgcggggaac ancttnnata atgcttntcn 720 

ncccnanntg gacntttgct ttttgnnncc nnaccccccc aaagggngcn cacctccant 780 

gaaaaagtct ttttnnaaaa gggctccttn ctnaaaaaaa nnnnt 825 

<210> 517 
<211> 1444 
<212> DNA 
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<213> Homo sapiens 



<400> 517 

ctctcncnnc nnnncnnntc tctnncnntn nnnnntnntn nnnctcnnnn cnnnatctnn 6 0 

nnncnnctnn nnnnncntnn cntccntctc ttntntngct ctcntntctc ntncatcttn 120 

ccnctattnt cntnntnntc nntcntcnnn antnctnnnt tctncctnnc canctntcca 180 

tnntntactn tcnntnntct ggctnttnta tntggggggt ctatttnttn ncttaaatcg 240 

actngttcca agtctcntan cngcntctnt ctnnctntct ntgcnctncn ctggggcntt 300 

aattncccnn gctnttatan aagngngnaa ttaaggtntc nnntctanng ctntgcaagg 360 

ctaatgntta gatccngnta gaanncgnta catgttggga acngacanct tnctgcncaa 420 

agngggctna ggcanngnnn tntgcaaann ctcnnntntc nnancttgnn tcncgtagan 480 

cggnnncccc tgaattttnn ancnngganc nttaaatnnt ntngnggtac gannccncnn 540 

ncgnnnnnnc gnntannccn canngttaan tgcncccnna nnnantcaac tctntnntcc 600 

tnntnnaacn nnnttantct annatnntta cnnntnagnt tttcctcnct nacnnctctg 660 

tncttnttnn atcttntnct tctcncttna tttntatctc ntntntntnc tnccctnatc 720 

tatctnctac nctctnttcc ncttctccct nncntctctc atcatatccc acgcnactna 780 

ncccctctnn ctcttacctn nntnctctcn tcntatctcn nnaccctctt tctntntctt 84 0 

atnncnccta tcctctactt attctcctcc tattntncca ctcacccttc ntntntctnc 900 

nctnntcttn tnctatttnt actntcncta ttcctncntc tctnntgnct cccaccccct 960 

cttcctctcn ctctcctnnn nnnactactc tcaccntctc nnctntcnct ctacnnntnn 1020 

ananntcctt antttcctnc tcatcacant actcttccct ctcatnntca nanctaantt 1080 

ntnctctcac tctaccactc tntnctccac tcatatnana cttctatant nctaatccta 1140 

tcttcttaaa cntctcctct tatcnctcta anctcctctt cntcgctanc tccnntncaa 1200 

ctcgnaaatc tctccaatnc tnccccactc taaaaatnnc ncntcngant cccacttttc 1260 

ngngcanaat nnaacncnan tccnctccct ttagctatct ctctanaaac cccntttctc 132 0 

aacaggnacc nccctntntc tcnaaatcct catnctncta ctttatatnt cnccaagcct 1380 

cncctntgta anagcatctc nctntccncc aatnnanatc tccctnctcc natanatntn 1440 

anat 1444 



<210> 518 
<211> 706 
<212> DNA 

<213> Homo sapiens 



<400> 518 

ctaatggctg gnngctcgtt ctttccgcaa cancccngcg antcgaattc ggcacgaggt 60 

ccgaagaaaa agactgtggt ggcggagatg ctctctccaa tggcatcaag aaacacagaa 120 

caagtttgcc ttctcctatg ttttccagaa atgacttcag tatctggagc atcctcagaa 180 

aatgtattgg aatggaacta tccaagatca cgatgccagt tatatttaat gagcctctga 240 

gcttcctaca gcgcctaact gaatacatgg agcatactta cctcatccac aaggccagtt 300 

cactctctga tcctgtggaa aggatgcagt gtgtagctgc gtttgctgta tctgctgttg 360 

cttctcagtg ggaacggact ggaaaacctt tcaacccact gctgggagag acttatgaat 420 

tagtgcgaga tgaccttgga tttagactca tctccgaaca ggtcagccat cacccaccaa 480 

tcagtgcatt tcatgctgaa ggattaaaca atgacttcat ctttcatggc tctatctatc 54 0 

ccaaactgaa attctggggg aagagtgtag aagcagaacc caaaggaacc atcaccttgg 600 

agctccttga acacaatgag gcatatacat ggacaaatcc cacctgctgt gtgcataata 660 

tcattgtggg taaactgtgg atcgaacagt atggcaatgt ggaaat 706 

<210> 519 
<211> 734 
<212> DNA 
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<213> Homo sapiens 



<400> 519 



tngtaccaat 
gagggnaaag 
gataaaagag 
aaagccgaan 
agagaacaga 
cggtacgaga 
aaagatgagg 
ctttactgtc 
gaagtcnaan 
aagaaaattt 
agaaatgnga 
ccatccntct 
cnattgatgg 



tatctgctgg 
cagnaagtaa 
tgcaggcgca 
agatgaggcg 
gctggatgac 
aggagtttgg 
aggatggtaa 
cancatgtga 
aagcatcggg 
ttcaagacct 
aagatgcacc 
anaactatnn 



aacc 



gtgagtcctt 



aaaaacaana 



ctanntagcc 
tgagcttgtc 
tcgaaaactt 
gcagcagaag 
tatggccaat 
agatggatcg 
agacagtgat 
caaatcnttc 
aaatggtggc 
caaattgatt 



taaanagntt 
cgtcagctgg 
gtggaagaac 
ctaaagcagg 
ttggagaaag 
gatgaaaatg 
gaggccnagg 
aagacanaaa 
cttgctaaaa 
gaaaatccat 
agctttctac 
nttacntcna 



ggtcngggcg aattcggcac 
tagctttcat tcgtnaaaga 
agaatgcaga gaaggcgagg 
ccaaactggt ggagcagtac 
agctccagga gatggaggca 
aaatggaaga acatgaactc 
acgctgagct ctatgatgac 
atggccatga agaatcacga 
caacagctng angangaacg 
tagatgacaa ttcttgagga 
acantnaaat ccnannaact 
tccagacatg antancnata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
734 



<210> 520 
<211> 701 
<212> DNA 

<213> Homo sapiens 
<400> 520 

ctaatgctgg ctnttgttct ttttgcagga tcccatcgat tcgaattcgg cacgagccca 60 

catgtaccag gttgagtttg aagatggatc ccagatagca atgaagagag aggacatcta 120 

cactttagat gaagagttac ccaagagagt gaaagctcga ttttccacag cctctgacat 180 

gcgatttgaa gacacgtttt atggagcaga cattatccaa ggggagagaa agagacaaag 240 

agtgctgagc tccaggttta agaatgaata tgtggccgac cctgtatacc gcactttttt 300 

gaagagctct ttccagaaga agtgccagaa gagacagtag tctgcataca tcgctgcagg 360 

ccacagagca gcttgggttg gaagagagaa gatgaaggga catccttggg gctgtgccgt 42 0 

gagttttgct ggcataggtg acagggtgtg tctctgacag tggtaaatcg ggtttccaga 480 

gtttggtcac caaaaataca aaatacaccc aatgaattgg acgcagcaat ctgaaatcat 540 

ctctagtctt gctttcactt gtgagcagtt gtcttctatg atcccaaaga agttttctaa 600 

gtgaaaggaa atactagtga atcacccaca aggaaaagcc actgccacag aggaggcggg 660 

tccccttgtg cggcttangg ccctgtcagg aaacacacgg g 701 

<210> 521 
<211> 784 
<212> DNA 

<213> Homo sapiens 
<400> 521 

naacacttng ctacnngttc tttttgcagg atcccatcga ttcgaattcg gcacgaggag 60 

atctctggga tgtcagtgag gctggttgaa gaccagaggt aaactgcaga ggtcaccacc 120 

cccaccatgt cccaggtgat gtccagccca ctgctggcag gaggccatgc tgtcagcttg 180 

gcgccttgtg atgagcccag gaggaccctg cacccagcac ccagccccag cctgccaccc 240 

cagtgttctt actacaccac ggaaggctgg ggagcccagg ccctgatggc ccccgtgccc 300 

tgcatggggc cccctggccg actccagcaa gccccacagg tggaggccaa agccacctgc 360 

ttcctgccgt cccctggtga gaaggccttg gggaccccag aggaccttga ctcctacatt 420 

gacttctcac tggagagcct caatcagatg atcctggaac tggaccccac cttccagctg 480 

cttcccccag ggactggggg ctcccaggct gagctggccc agagcaccat gtcaatgaga 540 
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aagaaggagg aatctgaagc cttgggtaag gatttggggc acagtaccag gaggggggct 
tggtgccaga cctcatgagg aagaaggatt ttcctatgta cagagaaggg gacccctgtc 
ctgttgggan gtgctgtgca aacctaacca aagttactaa cccctctggt ttctgnggtt 
acacaaangg ggataaatac aaagctttnc ctnaactagc caattctatt tgggtttcct 

gagt 



600 
660 
720 
780 
784 



<210> 522 

<211> 719 

<212> DNA 

<213> Homo sapiens 

<400> 522 

ttctaatttn aatccttnaa atnggttctt tntgcaggat cccatcgatt cgaattcggc 60 

acgagagaac acaggtgtcg tgaaaactac ccctaaaagc caaaatggga aaggaaaaga 120 

ctcatatcaa cattgtcgtc attggacacg tagattcggg caagtccacc actactggcc 180 

atctgatcta taaatgcggt ggcatcgaca aaagaaccat tgaaaaattt gagaaggagg 240 

ctgctgagat gggaaagggc tccttcaagt atgcctgggt cttggataaa ctgaaagctg 300 

agcgtgaacg tggtatcacc attgatatct ccttgtggaa atttgagacc agcaagtact 3 60 

atgtgactat cattgatgcc ccaggacaca gagactttat caaaaacatg attacaggga 420 

catctcaggc tgactgtgct gtcctgattg ttgctgctgg tgttggtgaa tttgaagctg 4 80 

gtatctccaa gaatgggcag acccgagagc atgcccttct ggcttacaca ctgggtgtga 54 0 

aacaactaat tgtcggtgtt aacaaaatgg attccactga gccaccctac agccagaaga 600 

gatatgagga aattgttaag gaagtcagca cttacattaa gaaaattggc tacaaccccg 660 

acacagtanc atttgtgcca atttctggtt tggaatggtg acaacatgct ggagccaat 719 

<210> 523 

<211> 710 

<212> DNA 

<213> Homo sapiens 



<210> 524 

<211> 730 

<212> DNA 

<213> Homo sapiens 

<400> 524 

ttnnnnnttt aancnttcaa atcnctaggc tacttgttct ttttgcagga tcccatcgat 60 



<400> 523 



tnnncttcaa atcgntngct 
cgagagatta tgagcatgta 
cagagagaca agatggtgaa 
ttcctgtacc tccaaagaga 
taatttcaga gagaggactt 
gtaaaggtca tgaggctgaa 
ataggctatt ccctaaactg 
gtaaaaagga agttcagacc 
aagattttgt tagcaataat 
ctgaattaga tccctttctg 
gtagaagcct aacagaagag 
aaagaaggca ggcaaagctg 



cttgttcttt ttgcaggatc ccatcgattc gaattcggca 
gaagatgaaa cttttcctcc tttcccacct ccagcctctc 
ggaactgagc ctgatgaaga gtcaggaaat ggagcacctg 
acagttaaaa gaaatatacc caagctggat gctcagagat 
ccagccttaa ggcatgtatt tgataaggca aaattcaaag 
gacttgaaga tgctaatcag acacatggag cactgggcac 
cagtttgagg attttattga cagagttgaa tacctgggaa 
tgtttaaaac gaattcgact tgatctccct attttacatg 
gatgaagttg cggagaataa tgaacatgat gtcacttcta 
acaaacttat ctgaaagtga gatgtttgct tctgagttaa 
caacaacaaa gaaattgaga gaaataaaca ctggccttgg 
ctgagtaata gtcagaccct aggaaatgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
710 



144 



WO 99/33982 PCT/US98/27610 



tcgaattcgg cacgagccca cactcggaca ctgtggaatt ctaccagcgc ctgtcgaccg 120 

agacactctt cttcatcttc tactatctgg agggcactaa ggcacagtat ctggcagcca 180 

aggccctaaa gaagcagtca tggcgattcc acaccaagta catgatgtgg ttccagaggc 240 

acgaggagcc caagaccatc actgacgagt ttgagcaggg cacctacatc tactttgact 300 

acgagaagtg gggccagcgg aagaaggaag gcttcacctt tgagtaccgc tacctggagg 360 

accgggacct ccagtgacac cggcccctnc ctctacccac ccccttcccc cgcatgctga 420 

tccccctgcc caggtaaggg ccctgccctg gaagactgga gggaggcccc aagccacggg 480 

gcatccccct ctcccaggaa gcagggaggg ggccgggagg ttttcctctc aagccccacc 540 

ctgggggccc gggggcgagg gctgccccct cctcccctcc ccagtgaggg acattttttg 600 

gtaaaaccta ttttcatttt ggaaaatatt tatgaataaa tagttttata tgaaaaaaat 660 

tntngnnntt nnnatnnnan aataaaancn tcgnncctct taaaactata gtgaagtcgt 720 

attaccttag 730 

<210> 525 
<211> 711 
<212> DNA 
<213> Homo sapiens 

<4O0> 525 

gcngntnttn antttcaaat cgctnggcta cttgttcttt ttgcaggatc ccatcgattc 60 

gaattcggca cgaggataaa tacctcagcc cctcgccttc ctcaacccac ctggcaagtc 120 

ttcttaggat ctgatcccag ttttctggaa gcaatcctac cccagcccaa gcttcccaga 180 

gtcgagcctt aatccttctc acttctcagt gtcagagcag aaatgaatcc tggggttgac 240 

tgtgtccatt cgggttatta gcagctaaga agcccagacg agtagtgtga gctgccttgg 300 

gagcctcagt gagggcactg ggactggcct cactctcttg cccccagcct agtgggcttt 360 

ctcctctgtc tctccggtgg ccccaggcaa tcgactgcat cacgcaggga cgtgagttgg 420 

agcggccacg tgcctgccca ccagaggtct acgccatcat gcggggctgc tggcagcggg 480 

agccccagca acgccacagc atcaaggatg tgcacgcccg gctgcaagcc ctggcccagg 540 

cacctnctgt ctacctggat gtcctgggct agggggccgg ccaggggctg ggagtggtta 600 

gcccggaata ctggggcctg ccttagcatc ccccatagct tccacagccc cagggtgatc 660 

tcaaagtatc taattcacct taacatgtgg gaagggacag gtggggcttg g 711 

<210> 526 
<211> 692 
<212> DNA 
<213> Homo sapiens 

<400> 526 

tacangctac ttgttctttt tgcaggatcc catcgattcg aattcggcac gagagaacag 60 

ggagaagaga ggaagaggga gctgcaggtg ccagaagaga acagggcgga ctctcaggac 120 

gaaaagagtc aaaccttttt gggaaaatca gaggaagtaa ctggaaagca agaagatcat 180 

ggtataaagg agaaaggggt cccagtcagc gggcaggagg cgaaagagcc agagagttgg 240 

gatgggggca ggctgggggc agtgggaaga gcgaggagca gggaagagga gaatgagcat 300 

catgggcctt caatgcccgc tctgatagcc cctgaggact ctcctcactg tgacctgttt 360 

ccaggtgcct catatctcgt gactcagatt cccgggactc agacagagtc cagggctgag 420 

gaactgtccc ccgcagctct gtctcccttg ctagagccca tcagatgctc tcaccagccc 480 

atttctctac tgggctcctt tttgactgag gagtcacctg acaaggaaaa acttctatca 540 

gtactttgat atgtcacagt ttcatgttta tccagttcaa tgtattttta aatttttcct 600 

tgagacttct ttgactgata gattattgtg aatgtgtttt taaatttcca aatgtttang 660 

gattttcata tctttcttat gctgatttcc aa 692 
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<210> 527 
<211> 769 
<212> DNA 
<213> Homo sapiens 

<400> 527 

gttctngttc tttttgcagg atccctcgat tcgaattcgg cacgaggcca agcctcggcc 60 

tccactgcac ctgctgcgga gtgggcacct ttgcctgcaa ggccttttnc ccantgncca 120 

atggtanttt aaccagggtt tttgncnntt aaggaggcct tngtggtggg tngttaatct 180 

ggccnttccn tattgaaaag ctcctgttat tgtccacaga ccagaaggac ttgtaacctt 240 

ggtcccacag tctgacttng gcttttcaag cacccagaaa acttagaggg aatcttatag 300 

attccagaac ttaaggatac ctcaagggat agggtcacag ccaagaagtn caaaggaatc 360 

ttcagtctgg aacaaaaaca gaaccctttc atgattgaca aangtcactt tctgtttgcc 420 

tggaccaagc tactncagat catctgacca actcttaaaa atcacggcca ggcacagtgg 480 

ctcatgcctg taatcccagc actttgggaa gcaaaagtgg caggatcatt ncagcccaag 54 0 

agttcaagac cagcctgggc aacacagtga gtgagaccct gctctattta agaaaaatna 600 

ttaagaaatt tattaaaaaa gaagaatcag gaaaccaagt ncaacccaac ttaacctcaa 660 

tgaaccagcc cctaacacag atgangggat ttgggactga taagctctgt gctgngtcca 72 0 

tggcccgtca nttatcaagg ttgcactttt aaatgnggta tttttatgn 76 9 

<210> 528 
<211> 757 
<212> DNA 
<213> Homo sapiens 

<400> 528 

tnaatatcag ctcttgttct ttttgcagga tccctcgatt cgcangaggg tgttcgactg 60 

ctngagccna gcgaancgat gcctaaatca anggaacttg nttcttcaag ctcttctggc 120 

ngngattctg acagtgaggt tgacananag ntaancagga aaaacaagtn gctccagaaa 180 

ancctgtaca gaaacataag acaggtgana cttcgagagc cctgtcatct tctaaacaga 240 

gcagcatcng cagagatnat nacatgtntc atattgggaa aatgaggcac gttantgttc 300 

gcnattttaa aggcaaagtg ctaattgata ttanagaata ttgnatggat cctgaaggtg 360 

aaatgaaacc aggaagaaaa ggtatttctt taaatccana acantggagc cagctgaang 420 

aacagattct gacattgatg atgcagtaag aaactgtgaa attcgagcca tataaataaa 480 

acctgtactg tctagttgnt ntaatctgtc tttttacatt ggcttttgtt nnctnaatgt 54 0 

tctccangct attgtatgtt tggattgcag angaatttgn angatgaata cttnntttta 600 

atgngcatta ttaaaaatat tgagtgaagc tnatngtcaa ctttattaag gattactttg 660 

ctgccaccac ctagtgtcaa ataaaatcaa gtaatacaat cttaataaac ntttaaacta 720 

taaaaactcg acccttagac ctatantnag tcggttn 757 

<210> 529 
<211> 821 
<212> DNA 
<213> Homo sapiens 



<400> 529 



tnannnannc annnnnnnnn nnnrmtttga agccattgct acttgttctt tttgcaggat 
cccatcgatt cgaattcggc acgagagcaa ttccactcct agctccaccc acaggaaatt 
gaaagcaaag acgcaaacag atgcctgtgc accaaagttc acgggcaagc atccttcggc 
cttaatgggc agcattccgt cgtcacaagc gggcattcat cctttcatca atagcgggca 
gcattccgtc gtcacaagcg ggcagcattc ctttcgccac aagcgggcag catcttgtcc 



180 
240 



300 



120 



60 
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gtcacaagcg ggcagcatcc ttcgccaaag cgggcaagca tccttcgtca tagcggcagc 360 

atcctttgcc atagcgggca aggtggaaac cctgtccatc cactgaggcg tgcatagact 420 

aaacatggcc agtccaggca ctggaatcca ggcccgtaga acggcgccca cggtcaaaag 480 

gaatgagacc ctgatgcact gggcgacaca gacgggcgac acagacttgg agacatcatg 540 

ctaagtgaaa agccaggcac acggagcgga cggcgtgatc ctgctcacgt gatgtgtccc 600 

gaatgggcac gttcagaggg aagaagggag atggcgcttg ccggtgcccg gggacngggg 660 

ttgggagcga cggttgctgg tttggggttt ctttctgggg tgangaantg gttttgatat 720 

ttggnccgtt ggtgatgttt gcatacctct gaatatgctt aaganccaca gaattgacca 780 

ctttaaatgg atgaattgna tggtattggg aattacccaa n 821 

<210> 530 
<211> 765 
<212> DNA 
<213> Homo sapiens 

<400> 530 

gnntttnnnn nnnnnnnttt tatnnntaca gctacttgtt ctttttgcag gatcccatcg 60 

attcgaattc ggcacgagac taccccggct acggttcccc catgcctggc agcttggcca 120 

tgggcccggt cacgaacaaa acgggcctgg acgcctcgcc cttgcccgca gatacctcct 180 

actaccangg ggtgtactcc ggcccattat gaactccttt aagaaagacg acggcttcag 24 0 

cccggtaact ctggcacccc ggatcgagga caagtgagag agcaagtggg ggtcgagact 300 

ttggggagac ggtgttgcag agacgcaagg gagaagaaat ccataacacc cccaccccaa 360 

cacccccaag acagcagtct tcttacccgc tgcagcccgt ccgtccaaac agagggccac 420 

acagataccc cacgttctat ataaggagga aaacgggaaa gaatataaag ttaaaaaaaa 480 

gcctccggtt tccactactg tgtagactcc tgcttcttca agcacctgca gattctgatt 54 0 

ttttggtggt gtgtctcctn cattgctgtt gttgcaggga agtcttactt aaaaaaaaaa 600 

aaattttgtg agtgactcgg tgtaaaacca tgtagtttaa cagaaccaga nggttgacta 660 

ttgttaaaaa caggaaaaaa ataatgtaag gtctgttgta aatgaccaan aaaaaaaaaa 720 

aaactcngcc tntaaactnt tntgagtcgt nttcgtaaat ccaan 765 

<210> 531 
<211> 768 
<212> DNA 
<213> Homo sapiens 

<400> 531 

gnntttnnnn nnnnnnnttt taagntactg ctacttgttc tttttgcagg atcccatcga 60 

ttcgaattcg gcacgaggtt cttcaaagcc aaccaagaca ggcttagcag ttttagagct 12 0 

tcagaacaaa ttgccaaaag ccagagttgt ttatgctagt gcaactgggt gcttctgaac 180 

cacgcaacat ggcctatatg aaccgcttgg catatggggt gaggggtact ccatttagag 240 

aattcaagtg attttattca agcagtagaa cggagaggag ttggtgccat ggaaatagtt 300 

gctatggata tgaagcttag aggaatgtac attgctcgac aactgagctt tactggagtg 360 

accttcaaan ttgaggaagt tcttctttct cagagctacg ttaaaatgta taacaaagct 420 

gtcaagctgt nggtcattgn cagagagccg gntcagcaag ctgcagatct gattgatgct 480 

gancaacgaa tgaagaagtn catgtggggt cagttctggc tgtcaccaga ggttcttcaa 540 

atacttatgc atagcatcca aagttaaaag ggttgtgcac tagctcgaga ggaaatcang 600 

aatggaaaat gtgtngtaat tggctgcagt ctcaggagaa gctnnaacat tagaactttn 660 

gaagaaggcn ggggagaatt gatganttgg ttcaactgcc aaagtgtgtg cantcactca 720 

ttggaaaaca tttnctgctc cagcngggaa aacttatggt tacttggn 768 

<210> 532 
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<211> 761 
<212> DNA 
<213> Homo sapiens 



<400> 532 

cgtnttttnn nnccnannga aagcccttgg ctacttgntc tttttgcagg atcccatcga 60 

ttcgaattcg gcacgaggat cagcccacct cggcctcaca aagtgntggg attacaggcg 120 

tgagccacct tgcccaccca catcatacag ttgaaatgaa actttgccac aaccagcctt 180 

tgctgtacac acacatatat cactgaacct ggttgaaata aagntttttt tctttttcct 240 

ctggtattct gggttctgaa gtctggtatt ctggtattct gggttcaaaa gtatgacttg 300 

agagtgttgc tctggtattc tgagagttgc tctgtattct gggttctgaa gattatttga 360 

aaaataactc ctactacatt gaaatgcaga cttaaaaatt taaacattgg attaggcagt 420 

caaaaaaacc aagcaagcat aaaaggtcaa taagttgtaa tcttgatagt aaaggtggaa 480 

aacttattat aaatggaaag aaagtttatt tccttttttg gttgatgggc agtatgccat 540 

attataccca aagttctttt aaaaaatatt tccatcacca tttttattta aaataaacat 600 

ttgagggaag taccaaggca gcttttttcc tcaaaagtac ctggtcctct ttgggaatag 660 

cacattttan gggcattggg taatcctgag attttactca ntaaatcctg atggtactgg 720 

gtgtaaaata tctttagtng gattgaaggc cttgnggggg a 761 

<210> 533 
<211> 735 
<212> DNA 
<213> Homo sapiens 

<400> 533 

taaacatcng gctacttgtt ctttttgcag ggatcccatc gattcgaatt cggcacgaga 60 

cactgtccca ctccatcacc caggctggag tccagtggtg tgatcatagc tcgctgcatc 120 

ctccagttcc tgggttcaag ccatccctcc tgcctcagcc tccccagtag ctggaactac 180 

aggtgtgtgc catcacacct ggctttacat ttttctgtgg ggtcttacta tgttgcccag 240 

gccggtctca aactcctgag ctcaagtgat cctctgcctc agcctccaga gtatctggga 300 

ttacatatgt cggctaccgt gtctggccgt tcacatcttt ggccactatt tgcttgtgaa 360 

aaggtataat gaggtggtac ttatcatttt tactgngtct catgttttgt atatttttgt 420 

ttcatcaact aagatgcact gtaacatctc tgaaatctgg atatattatc aatggtttat 480 

catagttttg ttagcaatac actgtctttt agtggtgcct aaaataatgg tatagttgtg 540 

aggtgatctt agatttgatg aagcacagta tgcaggtagg cctaatgggg gaagatggta 600 

atataaaagc aagaagtatt ttttttttgt aatgactgaa agctgtctgt ggatgaccta 660 

ccctttnctt taaacacgat tntntcactt ncaactncaa acttgctcaa ctaatncttt 720 

aaaaataact tgagc 735 

<210> 534 
<211> 735 
<212> DNA 
<213> Homo sapiens 

<400> 534 

natngnttgc tcctngttct ttttgcagga tcccatcgat tcgagacaac ccagaaacaa 60 

attcatacat ctatggtgac cacttttgac aaaggaatga agaacataca ctgggggaaa 120 

agataatgtc tttaataaat ggtgctggga aaactggntn tccantntgc agaagaatga 180 

aactagaccc ccatctctta gcatatacaa aaatcaaaat taattaaaaa gttaaatcta 240 

agacctcaaa ctatgaaaca gctaaaagaa aacatcgggg aatctctcca ggacattgga 300 

gtgggcaaag atttcttgtg taatacctga caaacaggca accaaagcaa aagtggacaa 360 
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atgggatcac atcaagttaa aaatcttctg cattgcaaag 
cacccataga atgtgagata atatttgcaa actatccatc 
gtctacaaag aaatacttga gactgagtaa tttataaaga 
gttttgcagg ctgtcaggaa gcatggtgct aacatctgat 
ggaagtttcc acccatggtg gangcaaaag gggaataagt 
gcaaaaanan gggggaaggg aagtgccnca caaccagatc 
ggngggngct tgngg 



gaaataacaa agtgaagaga 
tgtattaggc catttttgaa 
agaggtttaa ttggctcacg 
cagcttgtag ggaggcatca 
ttctccatgg caggtgcagg 
ttgtgagtnc tcagatttgn 



420 
480 
540 
600 
660 
720 
735 



<210> 535 
<211> 735 
<212> DNA 

<213> Homo sapiens 
<400> 535 

tnaannanag ctacttgttc tttttgcagg atcccatcga ttcgaattcg gcacgaggtc 60 

catacatgga gctccctgga cccgtgtgct ctcgtgtgac tgaacgtttt gtgatgaaag 120 

gaggagaggc tgtctgcctt tatgaggagc cagtgtctga attgctgagg agatgtggga 180 

attgcacacg ggaaagctgt gtggtttcct tttacctttc agctgaccat gaactcctga 240 

gcccgaccaa ctaccacttc ctgtcctcac cgaaggaggc cgtggggctc tgcaaggcgc 300 

agatcactgc catcatctct cagcaaggtg acatatttgt ttttgacctg gagacctcag 360 

ctgtcgctcc ctttgtttgg ttggatgtag gaagcatccc agggagattt agtgacaatg 420 

gtttcctcat gactgagaag acacgaacta tattatttta cccttgggag cccaccagca 4 80 

agaatgagtt ggagcaatct tttcatgtga cctccttaac agatatttac tgaaggaatc 540 

taggttgtat tttcagtgga caatgggaat aaagcatttc taaagcaccg actggagagg 600 

aaggcaacag aaacaaggag agaagcccga gagacatgtc tgcgtgctgc cacgcatctg 660 

ancgattgct cttgtgaaga gtttgtcact gaacattttc aggggaggct gtttacccag 720 

cnatgtnctn aacan 735 

<210> 536 
<211> 785 
<212> DNA 

<213> Homo sapiens 
<400> 536 

gccccccnnn nnnnnnnttt tcaaannccn ttnnnnnnnn nngnnnnttt tannnnnttn 60 

ttannnnaca gctcttgttc tttttgcagg atccctcgat tcgattcggc acgagctacc 120 

ttgggctggc cctctatnat gctntgaggg gagctgggac agatgatcnt nccctcntca 180 

gngtcatggn tnccangngt gagnttnatc tgccnnacat ngtgacggag tttaggaaga 240 

atgntgccnc ctctntttat tccatgatta aggganatcc atnnggggac tataagaaaa 300 

gcnnttttnc tgctntgngg ncaanangan tnacnngncc cgggnnanag ctcctatgct 360 

gtntgcctgc accaccccct gccttccttc atacctttcc ntggatatgn atgccagggc 420 

ttnncacatt gcctnattna tactnacntg ctnatgacca anacatncac gtgataacac 480 

aaacantggg tgcttgnttc tgatcnctag aggnganctn ttggnnngnt ggagnactna 54 0 

antnttctna gtgtnacttn agttcaatgc ctggccatnt gcnatnacct tatatcntnc 600 

aaagaggcta ctgtgctttt ancctttttt aaaacctcca tctgtattac attgnnaacc 660 

angtttcttt aatnaggagc ttgacctcta nantgggaac tcttgggaat ggncttagtg 72 0 

aagttcgcna ctaacttaac ctgaaaatta tnatgnnctg tttnacctat catgttnata 780 

actnt 7B5 



<210> 537 
<211> 967 
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<212> DNA 



<213> Homo sapiens 



<400> 537 



agtanggcgn ttcctaatnn 
aataccgnnc gaggggggat 
atntctttac cccccnannn 
cacacacang caagcactnn 
gtagacatnt nagacacgtc 
ngcanacnca gacggcacnn 
nncgntgggc cngtagtagt 
acgnnctttn aactaaaaan 
antnncttga gngggatgaa 
aattganatt attncntant 
tnaaanttnn ctnttttnan 
ggtanggann naattgtncc 
atnngngngg nccntntatg 
gntaacanan gnacnatggn 
nnnactatgn naaatattct 
tntgcatagn nactatncnt 
gcttnnn 



gggacgngtt 



annnggctaa 
nacaatagta 
ncacanatgc 
catgcatggc 
atgtnacaca 
ngagacanac 
gtactgtggg 
cttggagcct 
ccctaananc 
ngnncacgaa 
aatnaaacat 
tgggnatagt 
nggcnngcca 
accnatccct 



cacatntaca 



annaangggg 
ntnngaagna 
naaaaagnag 
aacnnanatt 



gcgactttna 
nacnnggtnc 
agacncacac 
ccatgctcac 
tgnnacacan 
atgcnnaaac 
tgnnactggg 
tnanttnntn 
ctggccctnn 
gcttntggta 
tttancggtt 
tctcacaant 



aagangaggc 
caatncatgc 
atngcanncg 
acacntgnan 
gnnnaanaca 
aacatgctcn 
tgccatcnac 
tggtgantnc 
tnccnctttc 
ncangngncc 
ctnaggancc 
natnttnaag 
tcgnngttaa 
aggaactncc 
gnnccnnggt 
ntatgcttta 



tngcgtgntg 
ttaacaccgc 
nacacncaga 
nnaacatgcn 
ctgctttnca 
ctcacntnna 
nnngtatttt 
aatncctana 
aaggccnagn 
cgagnnctnt 
gngcctncng 
gggnna agng 
natattccaa 
tgnncgacta 
ncttnaagnt 
nnngatnggg 



780 
840 
900 
960 
967 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



720 



<210> 538 
<211> 892 
<212> DNA 

<213> Homo sapiens 
<400> 538 

gctagttnga agaggtgttt ctaangnntn ggaatcgaca tctnnnnagg cngnccntgc 60 

gattcgcttt gctctctcca ttccaagttg ttctctgttc tagaaagcng atgnngggnt 120 

acatctactg tttttgccta aacagaatcc ctttntcctt tttttgttaa aaggctcatn 180 

cctaatatta cattgctctg gaacgantga caataccana actcagcacc ntgatcggac 240 

cgggacaatc agattatcta attcctcagc aaacggagat cgatccgaaa agtggaaata 300 

tganctcntn ctttgtgntg gcatatggac cctgagagaa agaaacttta atcttttact 360 

cttggactgc aatnaagtnt agctgcctaa aaatcnnttt cntgacactt ngnaggtttg 42 0 

tccacaatcg ggngaaatta nngggtnnga cntaancact ggatgaaaaa aaatnccgnt 48 0 

tanttntatt ncnnttccan ncttntnaaa tanananttt ntcanccttn nntaatacta 540 

ttanntatat ntnttnnncc cnnatnnncc ttcttnctcc tacnncnntn cnatntnnnn 600 

nnangntcnn cnannnnttc tnttatttct annatatntc ntancnttna ctaaaacctc 660 

cnctcgtnna nattncnnta taatattntc tctaganntt ntnntntntt gnnncttaaa 720 

anctcntcta tccctantat nantnattct taccatnaaa tacactanaa gtnntntcac 780 

gagacncgnt atgttantnc anactataat cgcttncatn tanntatatn taaaantgct 840 

atncagnnag nngntnttat atntttanct ngnnaggnta t catena tan cc 892 

<210> 539 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 539 

gnnnaggttn tagancagct cttgttcntt gngcaggatc cctcgattcg aatteggcac 60 
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gagagtgtca gttttcctaa tctcagtcca ggtaggaatt aagaaatatc tcaagtgttg 12 0 

atgctatcca agcatgttgg ggtggaaggg aattggtgcc cagaaaatgg gactggagtg 180 

aggaatatct tttcttttga gagtaccccc agtttatttc tactgtgctt tattgctact 240 

gttctttatt gtgaatgttg taacatttta aaaatgtttt gccatagctt tttaggactt 300 

ggtgttaaag gagccagtgg tctctctggg tgggtactat aatgagttat tgtgacccac 360 

agctgtgtgg gaccacatca cttgttaata acacaacctt taaagtaacc catcttccag 420 

gggggttcct tcatgttgcc actccttttt aaggacaaac tcaggcaagg agcatgtttt 480 

tttgntattt acaaaatcta gcagactgtg ggtatccata ttttaattgt cgggtgacac 540 

atgttcttgg taactaaact caaatatgtc ttttctcata tatgttgctg atggttttaa 600 

taaatgtcaa agttctcctg ttaaaaaaaa aaaaaaaaaa actcgancct ntanactata 660 

gtgagtccnt attacgtaga tccagacatg atnagatcat tgatgaattt ggaccaaccc 720 

aactagaatg cagtgaaaaa aatgcttttn t 751 

<210> 540 
<211> 761 
<212> DNA 
<213> Homo sapiens 

<400> 540 

gntnggntcn agancagcta cttgttcttt tgcaggatcc ctcgattcga attcggcacg 60 

agcctgcagc cactaatgca ttgtgtatga taacaaaaac tctggtatga cacattttct 120 

gtgatcattg ttaattagtg acatagtaac atctgtagca gctggttagt aaacctcatg 18 0 

tgggggtggg gtgggggtgt attccttggg ggatggtttg ggccgaatgg ggagtggaat 240 

atttgacatt tttcctgttt taaattctag gatagatttt aacatccttt gcggtcccag 300 

tccaaggtag gctggtgtca tagtcttctc actcctaatc catgaccact gtttttttcc 360 

tatttatatc accaggtagc ccactgagtt aatatttaag ttgtcaatag ataagtgtcc 420 

ctgttttgtg gcataatata actgaatttc atgagaagat ttattccacc aggggtattt 480 

cagctttgaa accaaatctg tgtatctaat actaaccaat ctgttggatg tgggttttaa 540 

aaaatgtttg ctaactaccc aagtnagatt tactggatta aatggccctt cgggtctgaa 600 

aaagcttttt taacttcttn gcttaaaatg ccgtttaatt ttgataagat ncttnaaatn 660 

gcctccaaaa gtgttananc caatcatttn aaataaaccn ggntgtatat tgcatnatgt 720 

gtacatgcnt atncccttct ggttaaaact naaaaaaaaa t 761 

<210> 541 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 541 

ggtttanttt aaatccntnc ncagctactt gttctttttg caggatccca tcgattcgaa 6 0 

ttcggcacga gcggacccat cggagcgtaa cctggatctc cgcaggcctg gcggaggccg 120 

gccacctgga ggggcattgc ttggttcgcg tggtancaga ggagcttgag aatgttcgca 180 

tcttaccaca tacagttctt tacatggctg attcagaaac tttcattagt ctggaagagt 24 0 

gtcgtggcca taagagagca aggaaaagaa ctagtatgga aacagcactt gcccttgaga 300 

agctattccc caaacaatgc caagtccttg ggattgtgac cccaggaatt gtagtgactc 360 

caatgggatc angtagcaat cgacctcagg aaatagaaat tggagaatct ggttttgctt 420 

tattattccc ttcaaattga aggaataaaa atncaaccct ttcattttat taaggatcca 4 80 

aagaatttaa cattagaaag acatnaactt actgaagtag gtcttttaga taccctgaac 540 

ttcgtgtggt cttgnctttg gttataattg ctgtaaggtg ggagccagta attatctgca 600 

gcaagtagtc acncttttca gtgatatgaa tatcatcttt ggcttggang ccantngaca 660 

acctgncatt actgactttt tgaaaanaac cctctggata ttgatgcctc gggtgtggtt 720 
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ggactgncat ttagtggacc ccgaatcc 748 

<210> 542 
<211> 784 
<212> DNA 

<213> Homo sapiens 
<400> 542 

gtnnnnntng tgtaatcgct tggctgcagg atccctcgat ggcgaattcg gcacgaggtg 60 

ttgctcaang agcagacccg actccntaag gtcatcattg aatgggcatn atangtttga 120 

anactgtcca ananantang ngtcaataca tcaacnnctt tanntgcttg atattgnnat 180 

tgaanaacac angnctcngn ctagttcgcc tganatgatg tttaagatac tccggaagga 240 

gacanantgt tntgantgcg gattaganac cacngaagnn acactnaagg ancancatct 300 

ccacctngna actgnattnn cngaccanaa aagngaactg gaccaaatgc tctcaaaggt 360 

gctggcagct taanagcgtg ttangactct gcacgaagan gacaggtnnt ntgagagcct 420 

ggnnannaca ctctcccaaa ctaaactgna nctttcaaca nangggancc ccannttggt 480 

ggagaaatca ggtganctgt tggcccttcc acaaagangc aaattctntg agggcnagac 540 

ttnanccttt ttgcngaacc agtncttgac tgactaaatg aaagcttttt aagccaggtg 600 

gcccancctt aangaagcna ctttttaatc cancggaacc ngcttgagan aaaaccnttt 660 

ttgacccaaa accnggagaa ccagctggcc taccaaaggg aaatgggccc ccatttgaac 720 

ttggggttnc ccangaacaa nccttgnccg ggncaaagcc cnttgttgga aaggacctca 780 

acct 784 

<210> 543 
<211> 764 
<212> DNA 

<213> Homo sapiens 
<400> 543 

ntantaaatc ccttgctctt gttctttntg caggatccca tcgattcgaa tncggcacga 60 

ggacccggcg gcgcggacag gcttgctgct tcctcctcct nngactcacc attncaganc 120 

agaanntgaa aaaatggnng anctcaccca ggtaanggat gatgaagtnt tnatggctnn 180 

tgcatactat gcannanttn tncttntgna aatgatgcnt atgagtactg taanngnntt 240 

ctatncattg ncaagaangg ntnttgncaa tncatangac tgtgtagcat tcggcanagg 300 

agaaaatgnc aagaactatc ttcgaacaga tgacanagtg taacgggtac gcagagncca 360 

cctgaatgac cttgaaaata tnattccatt ncttgnaatt ggcatnctgt attccttgag 420 

tggtcccgac ccctctacag cnntcctgta ctttagacta tntgtcggag cncggntcta 480 

ccacaccatg tgcatatttg acaccccttt cnnatccaaa tatagctatg actttttttn 540 

gtaggatatg gannactctt tccatggctt acacgntgcn gtaaagtaaa ttggccctgt 600 

gcagaaaaac attccactca gtnttccaan tggcttntta aggaattctn gaccttgcaa 660 

ttnatantgg agnnctttcc ttaagattta aaggtttgan ggngagccnn aggaattntn 720 

aaccnggggt aaaccctttt tggaattttn agcnttgnca anaa 764 

<210> 544 
<211> 755 
<212> DNA 

<213> Homo sapiens 
<400> 544 

gatgctggnt ncnnatgctt gnngatccct cgattcgaat tcggcacgag gaaatgtgta 60 

tttcagtgac aatttcgtgg tctttttaga ggtatattcc aaaatttcct tgtattttta 120 
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ggttatgcaa 
atacaggata 
caaagtttga 
ggaaacattt 
tgagagatcc 
attttacatc 
tgaaggtgaa 
aaattcaaag 
atatgtcnta 
gctgnaatgc 
aactttcata 



ctaataaaaa 
tgctactgat 
ttttctcttg 
tttatgtttt 
atcaaattga 
attcttgctg 
cattcctgat 
aagcttaata 
aatgtatttt 
tttganggtt 
atnatcccgg 



ctaccttaca ttaattaatt acagttttct acacatggta 
ttaggaagtt tttaagttca tggtattctc ttgattccaa 
tattacattt tttatttttc aaattggatg ataatttctt 
agtaaacagt atttttttgn tgtttcaaac tgaagtttac 
acaatctgtt gtaatttaaa attttggcca cttttttcag 
aacttcaact tgaaattgtn ttttnttttc tttttggatg 
ttttgctgat gtgaaaaagc cttggtattt tacattttga 
taaaaggttg cattctctca ggaaaaagcc atcttcttgn 
tgncctcata taccggaaag ttcttaattg gattttacca 
ttaaaaataa taacattttt aataattttt taaaaggaca 
ngntcctttn ccnnn 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
755 



<210> 545 
<211> 767 
<212> DNA 

<213> Homo sapiens 
<400> 545 

agnttttnaa tcctttggcc antcgcnctt tntgcangat cccatcgatt cgaattcggc 60 

acgagaaaaa gtnaagcttt tcatgagcac anntnccttg cattgttnga tgttactgat 12 0 

attcgtaaaa tgaatatttt ctgttttgtt ctgttnnatt tttttgagac aagtcttgct 18 0 

ttgttgccca ggctggagtg caatggcatg atcttggctc actgnaaccc ctgccttgcg 240 

agttcaagtg attcttctgc ctnagnctcc tgagtagctg ggattacagg cgctcaccac 300 

cacacccagc taatttctgt cttttnagtn gacacagggt tttaccatgn tggccaggct 360 

ggtctcaaac tnctgacctg aaactnctca caccngtnat ctcagcactt tgggaggctg 420 

angtggaaag gatcacttga agccatgagt ttgagaccag cctgngcnac acagcngaga 480 

ccccngtgnt gtacaaaagc ttncnacatt tanctggctg aggagtnnct cacccntaac 540 

ttccancnan tcnnttaagc nnanncatnt tgaacacntg agcccannta nggtcgatgc 600 

tnntagtnaa ccgtgactgg accacttaca gtccaagccc gggtngcctt ataaaagaan 660 

cggaaaacat ttcnttaatt cgggttnnag cnttanctat ttcggaatnc cttgngtttt 720 

naaaaacttg aatctccaan aaacagggtt ttttcttttg gnccann 767 

<210> 546 
<211> 989 
<212> DNA 
<213> Homo sapiens 



<400> 546 



tncccttgtt 
tgaggcaatt 
aagttggggg 
gtccttttac 
agaaanttgc 
nttncnnccc 
cttnantnta 
tnncntnncc 
atgncttccg 
nctccngctc 
ttancnncnt 
aatcgcgnna 
tcatctcccg 



gaaanccctt 
aaactggaaa 
actgaaggag 
agatataata 
agnttnacaa 
ttatngnaca 
ctcannagnn 
nnatancacc 
aatggantct 
nccctcattt 
ntncnttncn 
aataagtntt 
cnncactcac 



tgctcctttn tnctnccgtt tgncatncna ttcgctcagc 
agaaatagat tgaaaagata ctntngaaga agcagtacag 
agggagccac tgcaggtgct agctgcttaa ggggatacca 
gatacagctt ctgaggtgga gggtgatagg agtgtgtatg 
ctgctcntgc ctcctnggca anaggannan cntttcnccn 
cacattgncc tgattggncn tnccncngct agcttncagt 
nntnggggaa cncnctntcn nantatgntc ccttttcctc 
ccnctcnctt tcctttctaa acttncacan ntccctgana 
tngaatttct ncgcccctnc ntcntcataa tcnttttgct 
tnctacgtnc cnccttctnn ttnactgnct ttaaatntta 
atctncaant tttcnnnccn acnnnntttt nctnntnnca 
gcncactcnn ntnctancnt attntccctc gcnnttntcn 
ntnnncnnnt caattnntnn nnacncncnc tgctctacnn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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ncnatntctn tncctncaca ccctntancn tntcnctcan aatgcctttt ctnccttann 840 

nctntcnttc ncnnatctan ccaantttnc tttnacatcc cctncnnntc tnncccgacn 900 

atatntnacc tcttnnatcn cagngcntan natcnccccn ttntcnctnt cnctctcann 960 

cttntnttna tcttcatnna tcanncncc 989 

<210> 547 
<211> 781 
<212> DNA 

<213> Homo sapiens 
<400> 547 

tgtnnctttn cnncctcnnc cgaaatcnct ttgnttctaa ctttcctaat tacctgggct 60 

acttgcacta tcccntcgat ncgcatagat ggccnngtta ctaanggtga ntttccagcg 120 

cggggggcac gtggagtcac tggaacattt gngcaatgct ggtgggaatg tcaacccgng 180 

cnggcctctg gaatangcct ggcnnntcct gcnagagtta ccntgtgacc cagcaattcc 240 

actcctagct ccacccacag gantngaaag cnaagacgca nacagatgcc tgngcnccaa 3 00 

anttcacggc agcatcctnc gccatantgg cancatccgt cgtnacagcg gcatcatcct 360 

tcatcattac ggcancatcc gtcgtaacag cggctacatc acttcgccac agnggcagca 420 

tctgtngtca cagnggcngc anccttngcc aaagcggcag cntccttcgt catagcggna 480 

ncatnctttg ccatancngc naggtggaaa ccctgnccat ccactgaggc ntncatanac 54 0 

tanncatggn cagtccaggg cactggaanc cangccgtng aacggcgccn acggtnanna 6 00 

ggaatganac cntgatgcnc tggggccana catactggct anacanactt ggagacatca 660 

tgcttanttg nannnccant cacacttgcn nncggcgtna tcctgctcac gtgatncgac 72 0 

ccgaatgggc acttcaaatg ggaanaaggg ngatggcact nccggtnncc tnganagggg 780 

n 781 

<210> 548 
<211> 735 
<212> DNA 

<213> Homo sapiens 
<400> 548 

tctaaacgct tggnncttgc tctttctnca ngnanccnnt gcgntncgaa ttcggcacga 60 

tctagatatt gcccaatcgc tgcccacagt gcacatacct ttccaccagt cacatgtgag 120 

agggcagatt ttccaaatgc tcatcaccac ttggcactgt gtggactata attttggcca 180 

gttaggaaat ggcatctcat tgttttcatc ttaatttgcg tcagcctgat tactcattga 240 

aacttgtgag gttgagaaac ttttcttaag cttattggcc attcaagttt cctcctttat 300 

gaaatggttg ttcatgtcat ttgctcattt ttatattaga ttgtttttct tttttccagc 360 

tgacttgtag gaactctaca tcttatcaat attaatcatt tatcgaaaac tatttgggtg 420 

ccattatctt ctcctagtca atgttttttg tttgtgatat cttttataat atataagttt 480 

ttaatgttgg cagaagtaaa gttaatcttt ttggctgtgt tgtgtgtctt gtttgatgta 540 

aagatagttt ctgtaatagt tttgcagttt gattgntcat ctttaggtct tcaattcaac 600 

ctgcacatcc atcccctcta tcctctttct tactctgttt ttctccatac cacttatcat 660 

ccaataatat ggtcatgccc tttattnacc ngntttgcat atataatttg gcttgtnccc 720 

ggttccttcc ctana 735 



<210> 549 
<211> 812 
<212> DNA 

<213> Homo sapiens 
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<400> 549 



ttctaatact 



tggctctngt 
ggcgtgaggc 
ggcatggcga 
ggaaggancg 
ggccgacctt 
tgggatangc 
aggcttcang 
nctctgatgt 
gattangccc 
cnggacaact 
ttaaaccttg 
ggggtncccc 
gccttncatt 
gaattttccc 



tctttcngca ggatcccatc gattcgaatt cggcacgagg 
cagctgaggc atggtgaccc ctgggaagga gcgggcgtga 
cccctgggaa ggancgggcg tgaggccagc ttgaggcatg 
gncgtgaggc cagctgaggc atggtgaccc ctgggtacgg 
ggtttgccca gggcccctnc tgcaccacgg ccacatgcgg 
tccctgggtc cacagcttct gcccgtgtat tggggaaccc 
ctcttggcag atggggcaag gaaccctgag gcttccgcgc 
gggacacttg aacgangcac gattctgaag gactccatgg 
accttcngtt ggtggncnaa agccgtcctt ncggggcccg 
tttcnggtcg ggcttgttgg gccccaatcn ttgggttggg 
ggcccgncct tttaaccttt tttcccaatc ttttgacctt 
tgggcttttt ngggncaatt ggttccgggg gccaaaggtt 
gggnaaaacc ctggatccct tgttaancct ttgggagntt 
cccgggcttt tt 



60 
120 
180 
240 
300 
360 
420 
480 
540 



ggaaggagcg 
ggccagctga 
gtgacccctg 
gggacttggg 
aggacggcgt 
tncttggtca 
ccttccatgg 
atcttgggan 
gcttgtttaa 
naanttcncc 
tttccaaaaa 
gggaaaaaat 
aaaatggaat 



600 



660 
720 
780 
812 



<210> 550 
<211> 742 
<212> DNA 

<213> Homo sapiens 
<400> 550 

ggnnantcna tgctggtctt gtctctntct aaaagttggc nattcgaatt cggcacgagg 60 

ttctgtggct ggcatggtct gcctgctact ggagagatct cctgagantt cagttttgga 120 

ttggtgctgt catcttcctg ggaatgcttg anaaagctgt cttctntgcg gaatttcaga 180 

ntntccgntc caaaggagaa tntgtccagg gtgctttgat ccttgcaaag ctgctttcan 24 0 

cagtgaaacg ctnactggct cgaaccctgg catcatagtc agtctgggat atggcatcgt 300 

caagccacgc cttggagtca ctcttcataa ggttgtagta ncaggagccc tctatctttt 360 

gtnctctgca tggaaggggt cctcagagta ctgggtattt tncttatccc ttgactctga 42 0 

tagtaaacct ggccctntca gcagtttgac gcctgggtat ttatggatat taattagcct 480 

gactcaaaca atgaagcttt taaaacttcg gaggaacatt gtaaaactct ctttgtatcg 540 

gcatttcacc aacacgctta tttggcagtg gcagcatcca ttgggttaat catctggaca 600 

acccatgaag tcaanaatag tgacatgtca ntcggactgg ccggnagctn ttgggtagac 660 

catgccatnt ggcgccttgc tggtcttcca tgancccctt tggcaatcat gggtcttntg 720 

gcgaaccatt ttgcaaacaa ct 742 

<210> 551 
<211> 736 
<212> DNA 

<213> Homo sapiens 
<400> 551 

agtctaatgc tggtcttgtc ttttctaatg ctnggcgatt cgtcctggtg tcaaacacta 60 

taaacctttg accagctgag ctgtgactgg ctgtcacntn tctgagtcct gtgtgcacag 120 

tantntcctg ggtcaggtaa aatccaggtn ttcaagtttt aaggnttttt tgaanaattc 180 

gggcttnttt aanacgatcc ntgcccaant ccacaagctt gttgacagtg gnttacagtt 240 

ngngtggcaa agtccaagtt gttacactgn gctttaaaaa aaatcttatc tgcatgtatt 300 

gttaacttag agaccatgag atctatttat caggaccagg aagatncaca cttcaggtcc 360 

attgcaactg acttttttct tgtttttctt aaaaccctgg tggagcctgg gaagggggcc 420 

tccacaattc tgtggctttg atattagccc caattttaca agcacataca agccccataa 480 

ttgccgcagg aaaacacaag atggaaaatg caataaccca tgcactgaga cttagaaaat 540 
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catccttact aggcaaaatg tattatgatg caataagtgc cactgggnat tttnacgttg 600 

ggactggnca ggaactgctg caaagaaaaa taacagctcc ttctccatta tttacattta 660 

agatgttggt ggggggaagg ttgggagaaa ttagttctga gggtatcata tgcctttttt 720 

aaagaaaatg ggaata 736 

<210> 552 
<211> 733 
<212> DNA 

<213> Homo sapiens 
<400> 552 

nagtttaann gtatgtcttg tcttttccaa gatcctatcc gattcgaatt cggcacgaga 60 

agtgtcagtt ttcctaatct cagtccaggt aggatttaaa aantntctca agtgttgatg 120 

ctntccaagc ntgttggggt ggaagggaat tggtgcccag aaaatgggac tggagtgagg 180 

aatatctttt cttttgagag tncccccagt taatttntnc tgtgcttnat tgctnctgtn 240 

ctttattgtg aatgttgtaa cattttaaaa atgttttgcc ntagcttttt aggacttggn 300 

gttaaaggag ccagtggtct ctctgggtgg gtnctataat gagttattgt gacccacagc 360 

ttgtgtggga ccacatcact tgttaataac acaaccttta aagtaaccca tcttccaggg 420 

gggttccttc atgttgccac tcctttttaa nggacaaact caggcaagga gcatgttttt 480 

tngtnattta caaaatctan cagactgtgg gtatccatat ttnaattgtc gggtgacaca 540 

tgttcttggt aactaaactc aaatatgtct ttctcatata tgtgctgatg gttttaataa 600 

atgtcaaagt tctcctgtta aaaaaaaaaa aaaaaaaaac tcgagccttt anaactntnt 660 

gagtcgtnta cntagatccn gacatgataa gatcatgatg agtttggaca accncactng 720 

aagcagtgaa aaa 733 

<210> 553 
<211> 870 
<212> DNA 
<213> Homo sapiens 

<400> 553 

nagttaanag taggtcttgt cttttgcaag atcntancga ttcgaattcg gcacgagtat 60 

ataacaactt ttgctttcaa agttgggtgg gactagancn cncantggaa ggntggagtc 12 0 

agganacctg gattnttgng cccgntntgg nttttacagt ntgcctaant ttntgcagtn 180 

acttcntgcc ancctgtttc nttacntnca anagggaaag acantccttg gccagcctag 240 

ttttnagggt gaacgaaagg tcnttntcac tgcntcctct agtcatttgc ttcttcgnta 300 

attaacacat cttgagcacc tgcnatgttc caggaacagg agatggcanc gtgcaagata 360 

aagtccctga cttctagaga ctgcatgtta gtggcaatcg gcgtntaccc ggccttnaat 420 

aaactactga atgaaggaaa attctaccta caccagacac aattactggg gtttctaaaa 480 

tggaattatt cccccggccc cntgcatcca gcagcctgnt gcagggaaac tcctccnaaa 540 

ggcttgtaag gcaaggaanc cgggacaatg gcntggctat ttaagcttnc aacaagatgg 600 

ttacccctaa gtncctaatt ccctaacacc aagggggccc tttaccagga aaccaaaacc 660 

aggttaaaaa accccaaagt tgggnaaaaa gccatttgcc anccggggcc nttttaaaaa 720 

aaacctttna aaaacctttc ccttttaaaa ctttaccttc aagntaaaan tttaagggga 78 0 

atgggnccaa nttttttaac canccccaaa aaaaanttng gnaatttttt ttcccnaaat 840 

tttttnaant tccccaaatt tnggaaaang 870 

<210> 554 
<211> 766 
<212> DNA 
<213> Homo sapiens 
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<400> 554 

tatcaatgnt atgtntggtc tnttcgaaag anctagncgg ntcgaattcg gcacgagcca 60 

acacccagtt ctnactctgt catccaggct ggtgtgcagt ggtgcaatgt gggcttactg 120 

cagccttgac ctccaggaca agtgatctcc cacctnagcc tccggaatag ctgggactac 180 

agntcaacaa cgcccctctg aaagtaggac tcttggaaat gaaccttgtt gggagtaaag 240 

ctgaaccttc acctctcctt tccaggattc tactccattc atacggcctc acactgaatt 300 

aatgtttnta gcagccacat cacttngtta cccaattgat ctagtagtaa agtcttccca 360 

tctnttcatg taaaaaaaaa aannnaaaan gggnnaggaa ccntnangnt nnnaanaaaa 420 

aaaaaaanca ggnggngngc nttttttaac ctataacctg ntttnaggcc tttccccang 480 

tttntccnaa ncnnggttan taggggccna aagctaaccg natttttgnt cccntnaggt 540 

taggccngaa attaaccngg gtttaaagaa cncattgant aaagccttgc ctnggccaat 600 

tccgggaaaa gggaanagcc tccttgtttt acanattggg aaaaattggc cccaangggg 660 

gttaaccang tttgcccntt aataactnaa anggattttt gncaaaacct ggttccaagg 720 

ntttaanccc aanccttttn aaanntnggn cnctttggat gnaann 766 



<210> 555 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 555 

gttatccnat gngcgtntgt ngnnnncnct aanananttg gctngncgct gggccttgct 60 

tctctgagaa aactttggtc acacntccaa agccagggtg ggtgcctccn tgnaggaggg 120 

ggctttcctg gttggtggcn cagnaggagt ccaggctttg taccgtggac accatgggct 180 

atggcaacac cttcctaacc atccttccat gaggacctcg gnaganagtg gacatgaaac 240 

cctttgtgct ctgaancatt caacagaagc tttctggttc tgtgcctatt tctttggcac 300 

ttgancgtgt ttgcaggttc attacncaca tgatgaaagc tctggcccat agcactagaa 360 

ttcatgtttt nagggtttgt gagtgtgaca ggtgctatgg tttggatgtg gtttgtttcc 420 

accaaaactc ttgcttgaag tttaactgcc agcatggcaa ttgttggnag gtggggccta 480 

ccgggaggtg attgggtcat gggggcttga accctccgga atagattacn gctgcctcct 540 

ganaaagttc tacctgtcat gggggctgga tcagtcaaca ttgannantg gggttgttat 600 

aaagcaagac tnactcctta tgcaccgttt ntttgcatat gcccctctgg gggnancttc 660 

tttggctgct aacatttttg gacccaaccc aatgggcctt nacccagaaa nccggaacaa 72 0 

aatgccnnnn gccattcctt tnngganctt tccaacttnc canaaataat 770 



<210> 556 
<211> 756 
<212> DNA 

<213> Homo sapiens 
<400> 556 

gtngtcnatg anatgtcttg cctnncgaag aacnaggcgn ntcggtagaa cagaaaatga 60 

gcatccgatt tcttcactaa aggagaccaa actggttcct tgcggcctag tnttnaagan 12 0 

ctggancttg aaagtcctcc ttntaccaac tccacntcca ccccntnatt cccnttntcc 180 

caaagtncta ctgntgttgc ntgacanccc caaatntgtn ctgtcaacac aaacctgcct 24 0 

ttggngtata aacagggcnt tacagaatgg tncaccctat atatttctgt tcagtatcca 300 

ttcactagtt cttcattaat aaatatcatc ttccccattc tgctgctgaa tgccacacat 360 

ccatccagtc tgagaaagtg agagaggcaa tcatgccaag aacaagccag caaagctctt 42 0 

tcaccagatg tagactgtag ccctgctgcc ttccctccag cgagtctgcc agcatgcttc 480 

ttcatccttt taatatgtcc tttgcttcct acttccctgn cttccaacat actgtcactt 540 

actctggcag tcttctgctt ttcattaagc ctcaaaatct cctctgtcta cttggcacca 600 
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caagctatgt cctatatatg natttctgga cttggcangg atagttcaag gggtcttggc 
aagtttttat ttaccttcat tatttaaaan gggccttttg gggatgttgg cctntttaag 
gagccttttt ggggaaatca atacttctct taanaa 



660 
720 
756 



<210> 557 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 557 

tcgtcnaaan nnatgtcctg gctatccgca ggatccaggc ggntcgaatt cggcacgagt 60 

gatttttttg gttttttttt ttgntnttgn caaaagctta ntcntttcan ttaaaantgc 120 

cactantttg actttttaag taaaaantgt agggggtttt aaanctactt tcctnctncc 180 

aaaaantcag aaagtttcta nctttntaaa ttgggaaagc aagcantgtt ttaaaancac 24 0 

tgaaggaatc tctttnttcg ngnccttttg ttaaactcgg tttaagctgt agaccttntt 300 

taaantaaaa tttaccacag aacaggaaat agaanctgtg gaagactcga aatacacctt 360 

tgtncttctc tgttcttcac ctgctctctc gctgtctcta cacacacaca cacaaacaca 420 

cacacaccta tatttgcatt aaaaatgggt agtaaaagca gtgaagggca aacagaaggt 480 

ccattncatc aagtaagagg ttgaatataa actggaccaa gtcttaattt tttatttcct 540 

tcattcggat ncgtttacta atttctttgc tagctttaag acttttaaaa cattctttgg 600 

ccctgggagg gagttgttta cccctaaact tggagaatcc tggccctaga ataaatgttc 660 

cttttaaacc cccanggccg gaaaattgaa tncngctgtg ccaaaaagga aaaaannnaa 720 

aaaaaaactc gnggcctnta na 742 

<210> 558 
<211> 730 
<212> DNA 

<213> Homo sapiens 
<400> 558 

gggtcnntaa tntnnagcnt gtnaaacccc tgagncttnc gggncgttca caaagaaaca 60 

tttaataggg acttncaanc aaataattnt cggtttntca ggtggcagca agacaagatg 120 

gtggatcccc atgccattac ctgctagact cagggttnat atactgtagt ggaaaggtga 180 

ttccgaagga atgttgtaag acaattgaag tgcagtanca tcaaagttat ttgacctaag 240 

ggcaggagtt ncagtaagta tccactttta tncaagaaac antagataaa ctggaaatct 300 

tggagccctt cctggaactg gggttaatga gaagtcaaca tggtggatta ncatggaaga 360 

tggagttgct tagtctccca ttcaagatgg agtttcttta gcctccattg atagggagtn 420 

tttaacaaaa ncangaaata agtctttgat ccattgaatc tctaagagtg agcccttgat 480 

gactcaggtt taaacagtnc tgagacaatt taggagatag ttttgaagnt caatttgaat 540 

tgtaaaaggt caggattttt taactttttc acatctttga anaaaagccc atagagcgca 600 

agttttcagc aaganctgga aancnatatt nctatggaat taaatagctc ctcagggcaa 660 

tcaattnggc ctggganaac ataatgcttc aanggctgan gnaatctgga atttctatgg 720 

gatttcttca 730 

<210> 559 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 559 

gttagtctat aangtnngnt atgtactngc cctttccgnn ggatcccntc gnttcgaatt 60 
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cggcacgaga ggaaacaccc ccttataaaa ccatcatntc aggctgggtg atctgacaga 120 

gctagacact gtcaaacaaa caaacaaaca aacaaaaaaa ccccatcaca tctcatgaga 180 

cttatttact atcatgagag cagctcagga aacacccact cccgtgattc agttacatcc 24 0 

cactgggtct gtcccacaaa ttgtgggagc tacaattcaa gatgaggttt gggtggggac 300 

acagccaaac cctatcacca tgtaaaataa tatctaattt gtagagatta aagaacaaga 360 

taacttaaat cttggatgta agttaagaga gtggtggtca gagttaaatc attttaaggt 420 

tcatttattg tctggacaag aataaaattt tgattatcag gaaatacaag taaaaccaca 480 

gggagacatt gnttatatcc aaattgtcaa aaattacaaa gtcttataat accaagtttt 54 0 

gctganggtg tggagcaaca gaaacttttg ttcactggtg ggtatataaa ttgaataatt 600 

tcagcttgga cattacctag caaaattgaa ggctgtatac gtacatacct accaatctag 660 

caattcactt ctagatatta agtcttgaaa aactcacatg tttccagaga cgtgttaaaa 720 

ggtggttaaa tcattntgng aat 743 

<210> 560 
<211> 833 
<212> DNA 

<213> Homo sapiens 
<400> 560 

atccngttct ntannnngtc tngttctttc tncacgatcn nntgcgattc gaattcggca 60 

cgaggggtcc tggtgggagt tccatccagc agtgagtgca ttttttcccc agagcagtta 120 

agggtcttat taaaagccac cactttgctg aggcctgtac aggccttggg ggtttgggga 180 

agagaantaa ggcaggcact tgtcccttca gggagggact tgt contact gggaggtttg 240 

gggttgacct tggctccagc agagataccc agcctggcnt ggaagggcag gtcttgagct 300 

tacgcttgac tgcaagggca agctgcaggc ctcttctgcc ttcccctgca ttcaccaagg 360 

acaagtagga ccaagaagtc aagggaaaag tgccaagata gatctattcc catttctttc 420 

ttccacctgg agaattcctg agctatgctt caaacctctt ttgggccagg gaaagactgg 480 

gggacatttt ttagtcaagg atgctttaag aaagtaaatt cctgcttggg ggcccaggcc 540 

ttctttttca agggcttgct tgtgaatgcc caaccaaaaa aaaggggccc ccaaggccca 600 

atcccttact tcctnggtcc ccccaaaaag ggatnccaan ttggggaatt gggaaaactt 660 

gggcanncac ccnaanccca ctttggtagg anttnaccaa cccaaccaac ccaaaaccan 720 

cccacccaaa ttnaaaaaaa ggccaaaacc accaaccaac cnaaacccnn annnnnnnnn 780 

nannnnnnnn nnnaaaaaaa ctttgangcc ttttaaaaac tntttngngn ggn 833 

<210> 561 
<211> 773 
<212> DNA 
<213> Homo sapiens 

<400> 561 

tagtctaatg tnnnaaantn ngcnctngtt ctttctgcag gatcccatcg attcgaattc 60 

ggcacgagga agaggaggct gtgtatgagg aacctccaga gcaggagacc ttctacgagc 120 

agcccccact ggtgcagcag caaggtgctg gctctgagca cattgaccac cacattcagg 180 

gccaggggct cagtgggcaa gggctctgtg cccgtgccct gtacgactac caggcagccg 24 0 

acgacacaga gatctccttt gaccccgaga acctcatcac gggcatcgag gtgatcgacg 300 

aagctggtgg cgtggctatg gccggatggc cattttggca tgttccctgc caactacgtt 360 

ggagctcatt gagtgangct ganggcacat cttgcccttc cctctnaaca tggcttcctt 42 0 

attgctggaa gaagaagcct gggaattgac attcagcact cttncaggaa taggaccccc 480 

agtgangatg aagcctcagg gcttccttcc ggcttggcag actaacctgt caccccaaat 540 

gcagcaatgg cctggtgatt nccacacatn ctttcttgca ttcccccgac cttccagaca 600 

gctttggctc ttgcccctga caggatactt gagccnagcc cttgcctgtn ggccaaaccc 660 
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tgaattgggc cacttgccaa acttgcnggg gaaagggttc cttgaaacaa gggggccatt 
tttggggaag gcttcttggc ttggcctttt ggcatttnaa tttggccttt ttt 



720 
773 



<210> 562 
<211> 655 
<212> DNA 

<213> Homo sapiens 
<400> 562 

nnatanacat taangnnaga ngntgagnan ttnccntcgc tctntganna naaggcgncg 60 

cgaattcggc acgaggccac cggtctcttc ctaatctgca cattntattt tgggtatttc 120 

tgggcgggca gttcctttgc atgtttcggg agaggtttgt tgatttgggg cttatatgtc 180 

aggcctttgg tttgcgtctt attttagggg ttgtttgggg gcctgggtgg tcggcctcac 240 

atgggaaggg gatgggtagt ggatggggtt tctgtcgnat cttgnggccg gtgattttgc 300 

tnncgnnctg tttcacattc ttccccctcc acaagccaaa tcgttcattt ggntncactg 360 

tgtggactgt ctgagcttgc cctgccagaa aaatttgggg ctaggcaccc aggtgcanac 42 0 

tttggaagaa gcantccacc tgtgggtacc gcatctcgtg ngtcccactg gcaggctgaa 480 

cctacttgaa catggaaaca gcatgcccat atggcaaagg ggccnnnacn nnngnnnaaa 540 

tnnannannn ncngacannc nncnnaatca ngannntcna cannnatcnn annnnancnn 600 

nncaantacn ncnaaaacac accnnccana annnnnaann nnnnnncann nnnac 655 

<210> 563 
<211> 738 
<212> DNA 

<213> Homo sapiens 
<400> 563 

tnntaatgct ggaattcctn atncttgggc tactcgttct ttctncagga tcccntgcga 60 

ttcgcagaaa agagtatagt aggggatgac caaggtcaaa gtgggtaaag aagactcatc 120 

atccactgag tttgtagaaa aacggagagc agctcttgaa aggtatcttc aaagaacagt 180 

aaaacatcca actttactac aggatcctga tttaaggcag ttcttggaaa gttcagagct 240 

gcctagagca gttaatacac aggctctgag tggagcagga atattgagga tggtgaacaa 300 

ggctgccgac gctgtcaaca aaatgacaat caagatgaat gaatcggatg catggtttga 360 

agaaaagcag cagcaatttg agaatctgga tcagcaactt aggaaacttc atgtcagtgt 42 0 

tgaagccttg gtctgtcata gaaaagaact ttcagccaac acagctgcct ttgctaaaag 480 

tgctgccatg ttaggtaatt ctgaggatca tactgcttta tctagagctt tgtctcaact 540 

tgcagaggtt gaggagaaga tagacccagt tccatcaaga acaagctttt gctgactttt 600 

atatgttttc agaactactt aatgactaca ttcgcttatt gctgcagtga aaagngtgtt 660 

tgccatcgat gaatgctgca gaaatgggaa gatctcaaat tctttgctca aaaacgtgaa 72 0 

cttaaccaaa atgatggt 738 

<210> 564 
<211> 798 
<212> DNA 

<213> Homo sapiens 
<400> 564 

nggggngtct aatgctgcnc nnatcnannc anggnnctcg ctctngctcn acnnanaagg 60 

cgntgngtgt gccaccacac ccagctcatt attattatta ttattattat tattttgaga 120 

cgaagtttca ctcttatccc ccaggctgga gtgcaatggt gcgatactgg ctcactgcaa 18 0 

cctctgcctc ctgggttcaa gcggttctcc tgccttggca ggcacctgta gtgtcagcta 240 
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ctcgaagctg aggtgggaga atcgcttgaa cctggggggc ggagattgca atggtgtggt 300 

ctcggctcac tgcactcgag cctggcgaca gagcaagact ctgtctcaaa aaaaaaaaaa 36 0 

aaaaaaactc gagccntnna actattngng aggtcgtatt acgtagatcc agacattgat 420 

aagatccatt gatgaagttt gggccaaacc ncaacttgaa tgcnnngaaa aaaagcttaa 480 

ttgggaaaat ttgggaatgc ctatngcttt atttggaacc ctttntaagc tgcaantaaa 540 

acaagttaan caccncccaa ttggcntcca ttttaatgtt tncagggttn aggggggaag 600 

gttttgggaa ggttttttna aattcncggg ccnnggggnc ccaatgcttt ggggccccgg 660 

gtncccaann ttttgggncc cttttaangg gngggnttan attggccccc cttgggggna 720 

aaancgnggn anatacctng gtcccctgtg nanaaatngg nttcccntta caaaatttcc 78 0 

cacnnanatt tnngnncc 798 

<210> 565 
<211> 744 
<212> DNA 
<213> Homo sapiens 



<400> 565 

ttntnngttt naatnntcnn ggnntcgntc tnnctcnaan nanaataggt ttggcgaatt 60 

cggcacgagc atgctggcca gcatccctgc ctgtgcaagc tctggatgag ctgtgtgccc 12 0 

ctgccacnca cacccngcac tccctgccag cctggcctca gggcctctga tccatgtgca 180 

ctggagtggt gatgactgac agggccactg gggcatttnc acgttaacag cagctgccac 240 

tggcaaaaga agtgactcgc caatggtggc atctcagatg tgggcccagg agtctgggga 300 

gctactttga acagggctat ccattcattg tcccaccaaa ggctatggag cccacccacc 360 

atgtgctgga gtagtcaagg gaaataagac actctccttg tccttgttaa ctcaatcaac 42 0 

aagcatttgc agagcaccgc ctatatgccg gcgctgtccg aagtgctgaa gatacagcaa 480 

tgagctaagt aagcactgac ttcgtagaaa accataacat cggccatctt tggaaaagag 540 

aaaaacaatg gagttactta tttaaaaaaa aaagaaagaa agttatctct tccanganag 600 

gctagaagta cttttctgct ttttggccag tgcccantgg aatgcctggt ttgggggaag 660 

aagaagggac tgggttaact gtggtgcttt tgttgtaaaa aggcanctgg cctttgtact 720 

tgaggagaaa natggagcct tggg 744 



<210> 566 
<211> 756 
<212> DNA 
<213> Homo sapiens 



<400> 566 

gnagtnntat tgatttntct ccgtgaatcg ttctnnctnn annanaagtg ngttnngccg 60 

ctggctatgt ggacgctggg gcagagccag gccggagtcg aatgatcagc caggaagagt 120 

ttgccaggca gctacagctc tctgatcctc agacggtggc tggtgccttt ggctacttcc 180 

agcaggatac caagggtttg gtggacttcc gagatgtggc ccttgcacta gcagctctgg 240 

atgggggcag gagcctggaa gagctaactc gtctggcctt tgaggtaatg gggggtggcg 300 

gtggtggggg gtgcttantg gctatgctca ccccgctnca ttangcctat tttggtctgc 360 

tgtttccaaa tgcttctana tctaggcatt tggtatccaa cctattgcca cantgcctan 42 0 

aactncanac ccccngccnc tatgntnana cctacttggc acaagaacaa nngnanacnt 480 

tgnnnatatn ccanaangnn naanattaca nantnttata ataccaattn ntnttgangg 540 

tgttnnnnnc anaaacnttt gntnacngnn nnnnntatna atnnataatt nnnnntttgn 600 

nancannanc tatgnnnaat taaangnntn tntncnnnnc nnnacnnnna nnnnnnttan 660 

nnanttncnn ttnnnntnnn nrinnnnnrtnt tnaanaannt nnnnnttnat nnnannnncn 720 

nctnnaangt ntntttnnnn nnatnnnnnn nnnncg 756 
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<210> 567 
<211> 746 
<212> DNA 

<213> Homo sapiens 
<400> 567 

gnntgtnntt nncnnannnn anganagagn tactcgctct ntctctacga tanantgngt 60 

tncgaattcg gcacgagatt tcctccagtc ctgggcccca tccttnaggg ccttcccagc 120 

cagccagcag gagaggcaag aactggggga acacaggaac ctaggggagg aggggagcgc 180 

tgggcatcct caggctggcg gccaagcctg cccctggagg cactagagga gggcatctgt 240 

ctgtgggagc ccagagctgc agggaggagg aggagggagg tatctggtgt gagcgttgcc 300 

cctgcgacat ttgggaccac acaggtgggc ttccttattc cctgacaaag cctctgtttc 360 

cagctcttcc gccctctctg gatgagggaa cagaagtgga ggaaacaaaa gaagcagcag 420 

cacgcacagt cctgtcgctg ggtgcggaga cagcctggca aagtcccact cagccatggc 480 

ctgatgcang ccccagccct nctttcttgg gtgtcaaatg actgtgtcct ggacatctga 54 0 

tgcaccacct gccctgcctg ttgcaaacgt gatgctcccg gatggaatgg agaaactagg 600 

agactgggac aagcaaaang ctgcaaacaa cccagaaccc attcttagaa nactggagaa 660 

atgattgagg aatcattggc accgtggncc tgtgcttcat nacaaacacc tttnagaaca 72 0 

acttgggatt gaaaaaccaa gacant 74 6 

<210> 568 
<211> 738 
<212> DNA 

<213> Homo sapiens 
<400> 568 

gnnnntngtn gttcttanng ttnggatctc gttctttctn cacgatcncn tcgattcggt 60 

ctgggcagcc tacgctttcc ggataaaaat ggcagaatga aagaaattat gagtggaact 12 0 

agagaatagg aaagacatga accaacgccc aaaatgagaa agaaggacat ataaagaaaa 180 

agacaaatac aagtgaaaaa aatagactaa tggattaacg tccctgtcgt gtgacatttt 24 0 

ctgctatgga aatgatatta gacaaaaagc acttcaagtg gttttcttat ttgagttcaa 300 

aatgggtcat aacgcagcag agataacttg aaacatgaac agcgcatttg gcccaggaac 360 

tactaacgaa catacagggc agctgtgatt caagaagttt tgcaaagcag actagagcct 420 

tgaatatgag gaacacagtg gccagccatt ggatgcttca cttcttgaag catcttgaca 480 

gctttttgca ggtgaaatgc ttncacacca gcaggatgca gaaaaatgct ttccaagagt 54 0 

ttgttgaatn cagaacatgg atgtttatgc tgcaggaatt aacaaattta tttctcgttg 600 

gcaaaaaagt gttgattgna atgggtccta tttgattaat aaagatgtgt ttgagcctaa 660 

aaaaaaaaan nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 720 

nnnnnnnnnn nnnnnnat 738 

<210> 569 
<211> 753 
<212> DNA 

<213> Homo sapiens 
<400> 569 

gtttntgant ntgattctta tgcttngnct aatgctttnt ctnnangatc ccnncgattc 60 

gctggaggag aggagctcag agttctacag agtctttnct gaacaatatc agaaagctgc 120 

tgaagaggtg gaagcaaagt tcaagcgata tgagtctcat ccagtctgtg ctgatctgca 18 0 

ggccaaaatt cttcagtgtt accgtgagaa cacccaccag accctcaaat gctccgctct 240 

ggccacccag tatatgcact gtgtcaatca tgccaaacag agcatgcttg agaagggagg 300 
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ataaaaactt tcagaatgag caaaacacca tcaacgttaa 
ttttcctagt gagaaaacaa cccatttgaa gagaagaccc 
gagacatcaa gaatggattc agcagaatca tttcacgttt 
ggccaaagcc tttgatcagg gatcccgtca ttaaaggaca 
ctcttatgat gattaaaaga gaagggcagc cctnttcacc 
cttgcctgac cataaaatgg ttctcttctg nacaaagccc 
ccttaacaaa gtaggattgg ggttgggggg cttaattaat 
gagcccgaaa ccttagatnc cangggnana agt 



ttccagagat 
taatgagaag 
tgaacagcag 
ctcttgagta 
tttttggtct 
catcatttgg 
tggaatgggg 



ggaacatttt 
accctaaaga 
cagtttgaan 
ttagtaaacc 
ttctattcaa 
tgaacctcac 
ccaaggagaa 



360 
420 
480 
540 
600 
660 
720 
753 



<210> 570 
<211> 832 
<212> DNA 
<213> Homo sapiens 

<400> 570 

tnatnaataa ggtttgantt cttatgcttn ccaanngctt ggacctannt anccangcgg 60 

tgcgaattcg gcacgagcca ggccccaata atctgggntt naaactttga ggaaatgcca 120 

gtgacttatt ccagagtgcc tcagttaggg gaacttctct gtaaagaacc ctgggtattg 180 

agcaaaaacc ttattatcgt taatgaccta taattggaag cttcctgcct ttttctttgg 240 

ttgctcctgt ggaaaatact gaaaagatta ctttgtttta ttttgttgtc tttttataaa 300 

aggggaggtg gagagacccc ttcagagcag ggattgtgcc gggagagtgc ctctgacttt 360 

gggacatttc atccacagaa atttncaagc caatggtttc ttttgggttt tgggttttta 420 

tgtttgnttt ttgggggttt ggaaaaacat gcatttttac cgtgcacgta aaattggtca 480 

nagaaaaagg gagcccagaa aangcagcan atgggccatg cccctttgct gggttttcct 540 

tttcttttgg gactgtnaag gggaaatggg tttttanaag gtgaaggttt ggtcctgttg 600 

gaaggaaaag aantgtctct gttngggggg acaanaaggn acccttgggg gaggtccatt 660 

cgcaatggtn cctaccaaaa cnnggntctt taanaacacc ngggcctttg ncccaggnaa 720 

aaaaccctgg gcccctttaa naaactttgg nangggaacc ccggaaaacc cccttgggcc 780 

ttnccaaatc ttttttccca aagncncccc cggggggccc aaaaaaaaac ct 832 

<210> 571 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 571 

agtnttaatn ntggacttct aanganttng gctnntcgnt tggaannnnn cagtnctcta 60 

nnagcccatc gatgcgaatt cggcacgagg ctaggattac aggtgtgagc caccatgccc 120 

agccacttat ctttaaagga ttaagtttat gtttcctact atgggaaacc atcccacccc 180 

aaacttgatg accgcattat gtgcttttat agaacatggc acttctccag gatagcattt 240 

attctgtttt gtaagtgtga atgtaattac cctacacaca gcatacacat aatcttcata 300 

ttctttgcct tgtcttgtga aggcaagggc catgtctatc ttattcgtca ttagattccc 360 

acatccaaca tagtcctggg gacagcacca atgcactttt ggtgcataag caaatagtgc 420 

atttatagct cttacctaca atatctgata gactaatcaa atatagtagg ttatctgggc 480 

ctttttgatt catgtctcta gcttaacttt catttttttc ttatttggta tctctcactt 540 

tgccttttga tatactctta cagtttcgct cactgagtaa aagaaaatnt aaacagcaag 600 

aagtaaactt gtgttttatg gatttngata acatcttcta aaagaccccc caagattgtt 660 

gatgtctaaa aaaattaaag ggccttcaac tcataataat acttaatagt tcttaaaata 720 

ttacaaactg attggaacat tgcctaac 748 



<210> 572 
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<211> 755 
<212> DNA 
<213> Homo sapiens 



<400> 572 

agtcttatta nnnngttcta atcctttctt aangagnnta ggctactcgt nctttctgca 60 

ggtatcccnt gcgatncgaa ttcggcacga ggctgagcac ctttggaaac aacatttaag 120 

ggaatgtgag cacaatgcat aatgtcttta aaaagcatgt tgtgatgtac acattttgta 180 

attacctttt ttgttgtttt gtagcaacca tttgtaaaac attccaaata attccacagt 240 

cctgaagcag caatcgaatc cctttctcac ttttggaagg tgacttttca ccttaatgca 300 

tattcccctc tccatagagg agaggaaaag gtgtaggcct gccttaccga gagccaaaca 360 

gagcccaggg agactccgct gtgggaaacc tcattgttct gtacaaagta ctagctaaac 420 

cagaaaggtg attccaggag gagttagcca aacaacanca aaaacaaaaa atgtgctgtt 480 

caagttttca gctttaagat atctttggat aatgttattt ctatctttat ttttttcatt 540 

anaagttacc anattaagat ggtaagacct ctgagaccaa aattttgtcc catctctacc 600 

ccctnacaac tgcttacaga atggatcatg tcccccttat gttgaggtga ccacttaatt 660 

gctttnctgc ctccttgaaa gaaagaaaag aaagaagact gtgtttttgc cactgattta 720 

accatgtgaa actcatctna ttaccctttt ctngg 755 



<210> 573 
<211> 743 
<212> DNA 
<213> Homo sapiens 



<400> 573 

cangtctaat gctggctctn atcggttctt nnnantnaag ntactcgttc tttctncang 60 

natcnnntgc gntncgctca cacagcatgt gtcagatcca tggggtagga gtcggccaga 120 

gacttggtaa cagacagatt gctggatccc acccctagac tctctgattc agttagtttg 18 0 

gggtaaggcg caagactgaa tttttcacaa gtttcccagt ggtgctgata cttctggtcc 24 0 

aggaacttag tggggagaga acgactaatc tagaccattt cacttcacat tctgagcttc 300 

ttgtcactgt cacactgcat ccttttaaca atgcattccc tatcctattg caatactgac 360 

atctcatcaa tattttaaaa catgcgtttt cagaaacaat attttatatc aaatactcac 42 0 

ttttagtaat atttctgcaa ttttgcccta tggatctgag atctaacaaa tactattctg 480 

gacatgggct acaacagttg aggctggaag taaaaatgtt aaaccctgct gaccacgtta 540 

ttttaaagtg tattttagtt aagaataata tggcttagga gcagggctaa acagtagcag 600 

tcacatgggg aatgatactt tgcttttgca cataaaatgt cctgaaggga aaaaataaag 660 

cagaaaattn ncagatgaac tgaaaatctg tacaaatgtt gggctgaata ctgccagcgt 720 

tgangtgtag gaaaatgaac cnt 743 

<210> 574 
<211> 737 
<212> DNA 
<213> Homo sapiens 



<400> 574 

ccgtctaatg ctggnttcta atcgctttct taangctcnn gggctcgntc tcnctncacg 60 

cagcccggcg gtgcgaattc ggcacgaggg gattacaggc atgacccacc gcgcccagcc 120 

tgtnatttct tatactntgt attttggnct tgtattatgc ttctgatacg ctataattat 180 

ttatgtccat gtncntttct tcaatagact gtgaactctt cgaatgtngg actcctagag 240 

ctagatnctc nattattnnn tattaaattg aatgacttgn aactacagat cctttattta 300 

aacttcccaa atttctgctt tatctaggcn actctttaaa ttcttttatc tcatgtagat 360 
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ttcanaggct gaaataattg agatttttag tttgaagaaa agagaactgn ggatttaatg 420 

gent tat tat tatattttta atggctgttt gggagtnagg ttgeagacat tggtcacttt 4 80 

cctcctaaat ncttaaatat ttcctaaaaa caggncattc tttnttttnt tatggagtct 540 

ggctctggcn tccaggctgg antgecgngg cccatcttgg ettactgeag ctccccttcc 600 

cgattcncgc tggtctcctg netngctget egggaggctn aggecnggga ategttgace 660 

ceggaggegg aggttnenan agcctnnacg ggccctnggn ctcccggctg ggtaenngae 720 

cggacctccg nctgnat 737 



<210> 575 

<211> 766 

<212> DNA 

<213> Homo sapiens 



<400> 575 

gnagttnaaa ageggntttt antcctctcn aatcngnttg ggctactngc tctttctgna 60 

ggnatcccat cgattcgaat teggcacgag ctttctccct ctgtgcctcc tgcttccttt 12 0 

ctctctcctg cctctcctct gctccccatc ccactttctc atctgcctcc ttttctcact 180 

tctgtcagtc tgtaagcttt gataacctgc ttaatactcc aaagtgtgag ttcctctgat 240 

ctcttgattc cttagttcta atetcaegtt ttgtttttaa gagatggagt ctctcactct 300 

gtggcccagg ctggagtgca gtggcatgat catagctcat tgcatccttg aaatcctggg 360 

ctcaggtgat cctnccgcct gagcctcctg agtatctggg actacagatg cgtgccacca 42 0 

agectggcta attttgtctc atgtcttcta aaaattattt tgtgaagccc cttcacaaaa 480 

aaccttaang gaaatctgat ggtgctcagg aatctaactc tccctaaacc atcctctttt 540 

aactgettet aaaatatctc tggtggcctt tettagcett tttctggttc attcaatget 600 

teaaageget ttttgnttct aagttgagtn ctttgggggt ttgacaggta gtgacgtgta 660 

gttttgacac tgttaacttg ttnaatacag tgaaaangtt tgtgaagtga aaaatgcttg 720 

anaaagaatg gnaatgeett tntacaaata aaagtnttgt taaaat 766 

<210> 576 
<211> 761 
<212> DNA 

<213> Homo sapiens 



<400> 576 

ggggtnnnna gngnnttgan cccctttctt attatcaagg ngctngenct nnctnnannn 60 

ancacaggcg ntgngaattc ggcacgagaa gataacctct taatgeatte atgttgtata 120 

tgaaggaaat gagagcaaag gtcgtagctg agtgcacgtt gaaagaaagc gcggccatca 180 

accagatcct tgggengagg tggcatgeae tgtccagtag tatttattgc tttagagatt 240 

gcttgtcgta cctgtatgtc gtcccttttt aaatatgttt tcctttttct tgaaactgta 300 

taaagttttt ttccccctta gcataagcat cttatatata acaactcatt tgtacaaggt 360 

ttttaagttt atatataaaa tgtgtatata tatttttgnt tccccttttt gacttttttt 420 

ttctgtatga aacccagatg tcaccaaatg gacattaata gttgcattaa ggatcagtag 480 

cattaacaaa agttgcttta aaagecatta tgtaaaacaa gacttgaaaa tgagtgaggg 540 

aattttagcg acactgtctg agcacagtgg gaaccatctt cgtttcccct ttgaactcca 600 

antgggatgc cctaccctgg cgccccttag gaccccggac tggcccgngt acaaaacttt 660 

accgtgccaa aattcttaag tgaatttacc tttctncctc tttttgaagc tngaaatttt 72 0 

tggtcatcan gntttgcttg tgatngtaca tanggtngaa n 761 

<210> 577 
<211> 803 
<212> DNA 
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<213> Homo sapiens 



<400> 577 



gggtngttnn 
nananagaat 
ctgagggcgg 
ggctgggtgg 
gggtgacaac 
cagcctctag 
gagtttggac 
gatgctattg 
ttgcattcat 
cncngcgcca 
tttgccccnc 
gntaatcccc 
aaagnngtna 
antgccnttg 



nnngtggnnt 
aggtttgnga 
aaatggccaa 
ggtctggtgg 
atcatcacgg 
aactatagtg 
aaaccacaac 
ctttatttgt 
tttatgttca 
atgcattggg 
tntgccgtaa 
ttcnacaaat 
aaanccctgg 
cccttcactg 



tnttnnnngt ttctaatnnt cgnggngntc ganctnnctc 
attcggcacg aggtctcccg cccggcgccc ccagtgtttt 
ttcgggcctg cagttgctgg gcttctccat ggccctgctg 
cctgcaccgn catcccgcag tggcagatga gctcctatgc 
tccagccatg tacaangggc tgtggatgga ctgcgtcacg 
agtcgtatta cgtagatcca gacatgataa gatcattgat 
tagaatgcag tgaaaaaaat gctttatttg tgaaatttgt 
aaccattata agctgcaata aacaaagtta acaacaacaa 
agttcagggg gaggtgttgg aggtttttta aatnnncggc 
ccccgtaccc acttttggtn cctttaantg aagggtttaa 
ttcatgggnc atanncttgn tttcctggng ttgaaaattg 
ttccnccaca atcatttacc aaaccccngg gaggcctttn 
gggtggccct taatttaagt ggrmccttaa ctcncnttta 
cct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
803 



<210> 578 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<400> 578 

tcgtcccntn gatcggggta acgtccttnc ctatnaaant tctttcggga aagcagaaac 60 

caagctggca gaagcacaga tagaagagct ntcgtcagaa aacacaggag gaaggggagg 120 

agcgggctga gtcggagcag gaggcctacc tgcgtgagga ttgagggcct gagcacactg 180 

ccctgtctcc ccactcagtg gggaaagcag gggcagatgc caccctgccc agggttggca 240 

tgactgtctg tgcaccgaga agaggcggca gatcctgccc tggccaatca ggcgagacgc 300 

ctttgtgagc tgtgagtgcc tcctgtggtc tcaggcttgc gctggacctg gttcttagcc 360 

cttgggcact gcaccctgtt taacatttca ccccactctg tacagctgct cttacccatt 420 

ttttttacct cacacccaaa gcattttgcc tacctgggtc agagagagga gtcctttttg 480 

tcatgccctt aagttcagca actgtttaac ctgttttcag tcttatttac gtcgtcaaaa 540 

atgatttagt acttgttccc tctgttggga tgccagttgt ggcaagggga ggggaacctg 600 

tccagtttgt accatttctt tgnatgtatt tctgatgtgn tctcttgatc tgcccccact 660 

gtcctgtgaa ggacagctna ngncaaggag tgaaaaactt tacttcttaa aaaaaaaaan 72 0 

nnnnnnnnnn nnnnnnaa 738 

<210> 579 
<211> 758 
<212> DNA 
<213> Homo sapiens 

<400> 579 

gnngtgncta nctaaatnnt tggntntaaa cgtncnttct gcatnatccc tnnttgacga 60 

attnggcacg agacagagtc ctgaaatatg caaatgaagt aaattctgat gctggcgcct 120 

tcaagaaaca gcctaaagga cctgcctgat gtgcaagagc tcatcactca agtgcggtca 180 

gagaagtgct ccctgcaggc cgaagccatc cttgatgcaa acgacgctca tcaaacagag 240 

accttctcct cccaagtcaa agggacaaat aagcctctgg gttgaacggt ttgagacatt 300 

ctgccttgga ccttcccttg tcaccaaaca agccaacctt gtgcacttcc accaggcttt 360 

cacccattcc ctgcaagcct tggtcttttg acctggccct caaccatgtg gctttccacc 420 
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ccttgaggac aagttggaac agaagaccaa gagtggcctc actggataca tcaanggcat 
ctttggattc aggagctaac caggctcttn ctcgggggcg ggggagattc tgactcttaa 
tctggattgt gagaaaaatc cagcaagttc catgatattt aaatccaggt ctgcattggc 
ccggggcaag agtttaacat cttcgggccc tgcatttcct acatcttggg gtctgtacac 
gttcttaagc aagcgtgtca ngagagcacc ctgttggctt cttggtaaaa tgtgtgcaag 
gtcatnctgt cttctgnacc ttctggggaa aagggncc 



480 
540 
600 
660 
720 
758 



<210> 580 
<211> 816 
<212> DNA 
<213> Homo sapiens 

<400> 580 

tttctaaatn gcttgggttt cnaaatccct tggttgacgc cctcgcctaa nntggcgtgn 60 

nantgcccnc gattcgctgn caagtctgga antcatattg gagcctgngt ngactgaaaa 120 

ctcagcanga gttgatgtta aagtcttggg tctgaaattn gtngggcagg agattaggct 180 

ggaaactcag gcagaatttc tgtgttacaa tcttgaggca taattcttct ccaaaaaaat 240 

ctccattttt ttctcttaaa gccttggatg agccttggat gattggatga ggactaccca 300 

cattatctag ggtaatctcc tttgcttaaa gtaaactcac tgtgttaatc acatcaacaa 360 

aataccttca cagctacatg tagtgtttga ccaaacaact aggcaccata gcctagccac 420 

ataaaattac tatcattata ctttgtctta tcacatactt ctaccttgga agggatattt 480 

cccagttggt atagctacaa aacagaggca gatcatttag cctgcattng attngtantg 540 

aaaaataagc ctttggtgng tttaaccact gaaaatgttt gcggcctatt agtantngca 600 

caacttatcc tatnctggcc aaacatagaa tgctttcggt ttgcaaggta acangatccc 660 

ctttacagnt gtacnaaaaa tnancnntaa aaaaactnga gccctntaga acntnntagt 720 

ggagtcggan ttaacgttng ancccagacc ntggattang gatncattgg atggagtttg 780 

gacataccac cancttggaa tggcnantga aaaaaa 816 

<210> 581 
<211> 868 
<212> DNA 
<213> Homo sapiens 

<400> 581 

ccnnganncn nnccnnnnnc nnacaaaanc nnnnnnnann nnnnnnancn nnnnnnctct 60 

tcnaannctg ctnacgccca nagcatgacc cacgatcgaa tcggcacgag gttgcaagca 120 

gccttggaat agtaactctt ctcatttgtt tgggatctgg ccaccaagtn ccagaatgat 180 

acacggatca gngcanaagn tcatcaggct ctcggacctt agggctgntg gagaagcttc 240 

agcagcagaa ctgatggtga aggctcgtgt tctccatcct caactttctt tgcttcgatc 300 

atacacaaga atacattngg aagggcaaaa aaatgaacac tgtcgnncat tgcagcccgn 36 0 

gtttngtgac acagatgcac agtctgcttg tgaagacctt ctctcaagtg gcatttggga 420 

gtccatgcca gancatggtg cttcatgaga gactgacagc tatcaggggt tgnggcactt 480 

agngaggact ctcctccccc agtgtgtgct gatgacacat cacacctgac aatagctnga 54 0 

agnctnctct gacccntntt actctgtagc caacatacca catganttta aaacccnttc 600 

taaaatatcc aancaatggg gtcatacntg gcccaaatgc cagaantcna gagcctaata 660 

ggacttccaa tnattaactt tnccaaannc gaaaaaagna gggcnttccn nttatggcaa 720 

aaaaatnaan nnaaaaggan atntggnatn gttngccnaa aaaaaaagcc cnntnngaaa 780 

cctaatanga ggaggtccca cttaaccggn cgnancccca gaacantgga atacaggant 840 

accnatngga ntgaanattt ggggancc 868 



<210> 582 
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<211> 745 
<212> DNA 
<213> Homo sapiens 

<400> 582 

ttctgaatac cttnttacnc gccttcttca ggantttcaa gacctaattc ggcacgagac 60 

cctttctgcc ttctgtttgg gacccagctg gtgttctttg gtttgctttc ttcaggctct 120 

agggctgtgc tatccaatac agtaaccaca tgcggctgtt taaagttaag ccaattaaaa 180 

tcacataaga ttaaaaattc cttcctcagt tgcactaacc acgtttctag aggcgtcact 240 

gtatgtagtt catggctact gtactgacag cgagagcatg tccatctgtt ggacagcact 300 

attctagaga actaaactgg cttaacgagt cacagcctca gctgtgctgg gacgaccctt 360 

gtctccctgg gtaggagggg ggggaatggg ggaagggctg atgagacccc agctggggcc 420 

tgttgtctgg gacccttcct ctnctganaa gggaggcctg gtggcttaac ctgggcangt 480 

cnngtcttct ctgaccccan tggctgcngt gaaggggaac cacccttcct tgcttgacca 540 

ntggccatta nctnccntna ccacttgnaa cccanggtcc canctggctg ggaccctntt 600 

ntncccccaa ngncctttcc cttgggctnt nttggantga gcaccttctn tgtnngcacc 660 

ttttanaant gnnnnnntgn tactgatttt tttgntaaaa agannttaaa anctggnant 720 

ttntnaaaaa aaannannaa aannn 74 5 

<210> 583 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 583 

gnttctaatn cttggcctac tcgcctntct ncaggatctt atcgatncna attcggcacg 60 

agatatggta tagttggaaa taggttattg tgagttattt gtagtcatgt ctttaatggc 12 0 

ccttgcatgg tgtctaactt ctgcaataaa tgatctgcca gtcctagtgt ctgggcttta 180 

tgcaatttgt tttcctttgt ggatgaagtg ggagtaagac ttgttgctgt gaggatcaga 240 

tgaagtggct aggatatgga cacactttac ttgaattgga aaacaagcca tgtatcccta 300 

atctgcaaaa tgtggcatgt cacacgtgta atctctgagg tttagttttt gctcaagatt 360 

gcaaaggtga cttgcttgat gctttctttg cttgagcaca catctcattc attaaatggt 420 

gtctcctttt ttgcacacag gatgcagaac ataattgacc ttttccaagt ctacttagca 480 

gaaatgaaaa tggaatcata taaatacagt attatacttt aaaataaaaa ggctgtacaa 540 

aagtttggct gacatagctt gcttctagta atctgaatgg cttatttaaa taaagttgga 600 

tctatggact cttcacagnc tagatattat cctactggaa gatgtgcctc gaaagctgtt 660 

gaaccacngc aaaaaaaccc ttcagtcagc acgtgagaaa acctgcgagc ccacatttcc 720 

cccgggacca ttctgaacat cctactgg 748 

<210> 584 
<211> 773 
<212> DNA 
<213> Homo sapiens 

<400> 584 

tttaatgctt gttacacgcc ttctgcagga tttatcgatt cnaattcggc acgaggctat 60 

gtattgtgtc ctaccatgaa ttcactccat gctagccaca ttggcctgta tggctattcc 120 

ttggacacac ctaggatgtt cttgcctctt agcttgccta cctttctctc atcatttggg 180 

cctcancgag gatatcatct cctcagagaa gccttctgtg accatgctat ctaaaatact 240 

ccagcacttc agtcaccctt tatcccatta ctctgctttt tcagaaacat tggtgctccc 300 

tgaaacatat ttgtttactt gcttagtgtc ttttctcccg cactaccatg taagcttctt 360 
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gagggttaag ggaccttgtt agggataacc actgtatcct 
gttctcaata catatttttg aaactctacc ctgatgcaaa 
gtttttgcat tataaatggc tttggtgaaa tccctggcac 
taaacagata atgttgaagt tctgggcctg caaaacctaa 
taaatgtgtc attgggatcc ataagacttt gtgggaaagt 
aaaagcattg aaccttcaaa agtnaaaact ttatnggncc 
gccgttcttt aagtcatttt aaaccctttg angccnacag 



taga'gtgtga 
agagatatca 
aaaactaata 
ctcttttaaa 
caacataatt 
aaaatctcaa 
ttttacacca 



cacatagtag 
aataattata 
ataaaagaaa 
gcagtcccag 
ttattnggga 
ttactggggg 
aat 



420 
480 
540 
600 
660 
720 
773 



<210> 585 
<211> 745 
<212> DNA 
<213> Homo sapiens 

<400> 585 

ttcaatacnt ntttcnngcc ttttgcagga tcnctcgatt cgatggaaca tgagtggaag 60 

tgggcagtct ttttctttcc ctatcagctg agtgaatgaa gatttagagg gcagcagagt 120 

catgacatgg atgacgttgg gtctctggat ggctaaatgg aagacccgcc ccccaacgcc 180 

actctacccc cctgctttga actatgcttt gagaaatgag cttatgagac cactgagact 240 

tgggggctgt ttgttcagca gttcacctac acttattagg aaaggttgac ttcttgtact 300 

acgcctttcc ttaaatcatc ttttgtataa ttctcagaac actgctggtt tgggtggtct 360 

cacacatttc tcacatccaa attttaaaga tttcatgaat gttcattaca gtggatttat 42 0 

ttttctcttt ctgcttctcg gcatgccctc tcaatttggg agaaatctct aattggatga 480 

ctttggtggg accacaggag tgtaaggatc gtaattccct cacttcatcc cctgcaaatt 540 

aaagcctggg cacttaagac tcactcaact gaatcttgat atgtgggact ttanatctta 600 

agcaaatgan gcaaaagaag gaaaagacag ttgagaaaat caatctctga agttcagcac 660 

ttgatttcca ccgtggaccg gactcctgca nctttgcatt ngccttggtt cctggccatt 72 0 

ttccnaaccc gggttccttt ttgan 745 

<210> 586 
<211> 749 
<212> DNA 
<213> Homo sapiens 

<400> 586 

tgttctaata ctaggtntac tcgccttttg caggatctna tcgattcnaa ttcggcacga 60 

ggggtcctgg tgggagtnnc atncagcagn ganngcattc tttccncaca ncagtnaacg 12 0 

gtcttattaa nagccaccac tttnctgang cctgtacagg ccttgnnggt tnggngaaca 180 

gaaatnncgc aggcacttgt accttcaagn anggacttgt gcctnactgn nagggttggc 24 0 

gttgaccttg gctcnacnga catacccant ctgacttnna acgngcncgt ctnagcttac 300 

gctagactgc acnnccaagn ttgcangcct nttntgnctt ccctgcattn accaatgaca 360 

gtacgaccaa cagtcaanga aaagtgccaa gatatatcta tcccatttct tctacacctg 420 

tanattcctn actatgctca aactatgtgg ngcaangaan actggnngac atttttagtc 480 

aatgatgctg acaattaatt actggtgngg ccaggcatat nttcacggct gcttgtgatg 540 

ccaacnaaga acgggcccca gcccatcctt actcctngnc cccaaanaga tccagtgnga 600 

atgggaagct gnnannacca acccaactnn tgatttacca ccaacnccaa anatcacgca 660 

tgnnnacagc aaaacaacaa cncnatgcac ttaacaagna nccnaaaant naactcgngc 72 0 

ctctaaaact attngggant cctttanct 749 



<210> 587 
<211> 783 
<212> DNA 
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<213> Homo sapiens 



<400> 587 

gttctaatnc ttggcctact cgcctntctg caggatcttn tcgaccttat tcggcacgag 60 

cccaaggcaa gctgttaaca aaatcaacct gggccaatca tcaaagggtt ggacctaagg 120 

ttgctatact caatagaaca agcattttaa ataaatttct cgtaagttgt tgctttcttt 180 

atgtggtggg tgtggcttta aagagcacaa aaccacaaca aatcaaagag tagctcgggc 240 

ttgtcttttg ctttatggct gagggtttga aggatgattc atggacttgt gaatgccagc 300 

cccagtcccg gcttaggtct atctgccaat accaccaggg ccaacaaatt cacgcaacaa 360 

attctctcat tttttacagt ttatcagttg cactcatagt tattgtcata atcactcccc 420 

acagtaacct gtaaggcata taaagtagct attttagtaa gataaatgat attttatata 480 

tgttatgata agataaatct tatcatttta agaagaaact gagctcggag agatgaaatg 540 

acttcctcag ttgctgctgt aataaaagtc tactttttgc taaaaaaaaa aaannnaaat 600 

atnntntann attnnantaa naanaaaaac ttcgagccnt tttnaaactt tnantggagt 660 

cnntttntcc cgtaaaatcc nnnactttgg atnaanannc catttngatn aagtttttgg 720 

gacaaacccc ccaacttaga aattgcnntn ggaaaaaaaa ntgcntttta ttttgnggaa 780 

aan 783 



<210> 588 
<211> 771 
<212> DNA 

<213> Homo sapiens 
<400> 588 

tcttctaatg ctggttacan gccttctgcn gatccctcga ttcgaattcg gcacgagata 60 

ctttttaaac cttttttggc agctcagatg gtgtaaattt taaaattttg tataggtatt 120 

tcataacaaa aatatgtatt tcttttttgt tattttatct tgaaaacggt acatatttta 180 

gtatttgtgc agaaaaacaa gtcctaaagt atttgttttt atttgtacca tccacttgtg 240 

ccttactgta tcctgtgtca tgtccaatca gttgtaaaca atggcatctt tgaacagtgt 300 

gatgagaata ggaatgtggt gttttaaagc agtgttgcat tttaatcagt aatctacctg 360 

gtggatttgt ttttaaccaa aaagatgaat tatcaatgat ttgtaattat atcggttgat 420 

tntttttgaa aagatgaacc aaaggatttg actgctaata ttttattcct tacacttttt 480 

tctgaataag tctctcataa tgagtgcagt gtcagactgt gcctactctg atggtatngt 54 0 

gccatttgta aaatnanaat aagagcagaa aaaacacaaa nangagaaca ctggnttcag 600 

acattcantg gggcaagtta aattatggga ctgcaaaaat aatggatttt ttattcaaag 660 

aaaagcttta aaaagtttta ttatccanat ttacaaccca ctanttaagc taaataancc 72 0 

tactttnaaa aatngnaaat ggttnctatc tttataangt gccaanttna n 771 



<210> 589 
<211> 844 
<212> DNA 
<213> Homo sapiens 



<400> 589 
tncactnnaa tccttntnta 
gaggccagag cctagaggag 
caactgggtt cccatggcat 
tgctttggtg ctctgcccag 
acagcttcct ccagcagcag 
gggaggaaca gccaccccag 
ctgcccagac tgaggctctt 



tgcntgatcc 


catcgattcg 


aattcggcac 


60 


cnttngccga 


agtgaagccc 


attctgcaag 


120 


tagaggaggt 


gttcagcctg 


ccaccgtcgg 


180 


gatccgaggg 


ggcctacaag 


gcggccgtgg 


240 


gggccggggg 


tggtcctggc 


ccgactcaag 


300 


acccccagaa 


cctggcaaga 


ccgcctgccc 


360 


tctgctcaat 


gaggacactg 


actgccaagg 


420 
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aggagcttct gcagaccctg cggacccacc tgatcctnca 
actccaaggt catgactggg gacagntgca cacgcttggc 
tgcnctgggt ccaaagggcc ttcctggcct gggatacngg 
ccgggngaac gtnggtggtn ggtgccggnn cattgcctgg 
angaangntc gnttatttct aacaaaaccc ggnccttgnt 
tggnnnnttt tnaanacncn annncccaat tngnaanaac 
aaaaaaangg ggccnatatn ntntntcana cccgnggnct 
tnct 



cgtggcccga gcccacggct 
tatcaagctc atgaccaacc 
ctttcttgga tgaaccngna 
gaaccaccac ccccttnaaa 
tcntaccntn ttccctntct 
ccnaaaangg gnccctttgn 
ttgaatnngg aaaangccnc 



480 



540 
600 
660 
720 
780 
840 
844 



<210> 590 
<211> 767 
<212> DNA 

<213> Homo sapiens 
<400> 590 

tctaatgctt ggntctngcc ttttgcggat ctttcgattc gnattcggca cgagagaacg 60 

ttctcaggtt gaccagctgc tgaatatttc tttaagggag gaagaactta gtanntcatt 120 

gcagtgcatg gataacaatc ttctgcaagc ccgtgcagcc cttcagacag cttatgtgga 180 

agttcagagg ctacttatgc tcaagcagca gataactatg gagatgagtg cactgaggac 240 

ccatagaata cagattctac agggattaca agaaacatat gaaccttctg agcacccagg 3 00 

tttggcatag aaatggtacc ccttgttcaa aatgaacaag aagccttaga tttggatggg 360 

gaacctgatc tgtccagtct agaaggattc cagtgggaag gtgtttccat ttcctcgtcc 420 

cctggcttgg caagaaagcg aagcctttct gagagcagcg tgatcatgga cagagctcct 480 

tctgtgtata gcttcttcag tgaggaangt acaggcaaag aaaatgagcc ccagcagatg 54 0 

gtttcaccta gtaactcatt ganggctgga cagaaccaga aagcaaccat gcaccctcaa 600 

acaaggaagt nacacctcng gctggccttc ccttccgaac aggtgaaagg ggcttgaaaa 660 

atgttgctac cccaaaggcg acattnntgg caccaaatta tccctcttga ccnntttaat 720 

accttttgat tncatttngg caaaagactt tgnaccagcc nnggaga 767 

<210> 591 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 591 

tctttgaatc cttttgtaaa agccttttgc atgatccctc gattcgaatt cggcacgaga 60 

cttcttgttt gcctttttta taaggaaatg ttggagagtt acatcattgc taatgtagaa 120 

atgttaagtg gaaaaatata cagtttggta aaataaacta gattctacat ttatttgtgg 180 

gtttttttcc cctcctttct ttccacagca cttttgatat caagcaagtg gcttcctttt 240 

tgagatatta aaaaaaaaaa gaaaaggaaa aaagtaaatg aagcccaact acctaaccct 300 

ttcttatttg tatttgtttt agtattgtga agttgtgtta aatagtacta gctagaaata 360 

caaatttctg gttatcattt ctcttccctg tggcacttga cattttaatt gtcttaaagt 420 

ttttgaagtc atcttctggc cccttgagta ctgccagagg caaaagatgt ttgtttctta 480 

ttcattccac ttttgtctcc tgggatccct tctgtagcct aaagtatggc tgggaaatgg 540 

acttgagaag attggcttga attangatca taatcatgtg tgatcccatc atgaattcat 600 

tggaatntgg ggtncatgta angcaatcnt tctggtgtaa atcttccttt ttttaatgna 660 

catatanttt tggaaaaaat tttgaattaa ccctgaaaat ttttaaaaaa gccctcttan 720 

aactattann ggaggtcnca ttaccctaga atccanacat tnant 765 



<210> 592 
<211> 757 
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<212> DNA 

<213> Homo sapiens 



<400> 592 



gattgagttc 
ctgcaggcgt 
gccccggccc 
tgtacacgcc 
acccaactgg 
aggcgcgttt 
gggcctcctc 
gctcttgagg 
gttttaccag 
ttatttcttt 
tatttacctt 
ttggaatgca 



tnttcnaana 



ctngttctng 
acagtgtatg 
gcacactggg 
ctatacccca 
tncaaagtgg 
gggcaaaggt 
gtgcatatct 
agtgaaggtg 
aacccaggct 
ctcccaaggt 
ttaaaaaaaa 
agatncagac 
ntgaaaaaaa 



ncttttgcag gatcccatcg attcgccaaa tctgcctaga 
ttctgggggc gctggtgcag tcagcggtcc agtctccagc 
gtggacgatg ggtggccccg caggtgtaca catttgggtg 
gtgttctctt tgatccagtc ccgaaacaga gggagccttg 
agctgggagg tagaagggga ggacactggt ggttctactg 
ttgaagacac agcctccccc gccagcccca agctgggccg 
gcctcccctg tctctaagga gcagcgggaa cggagcttcg 
gtggggctgc cggatctggg ctgtggggcc cttgggccac 
cggaggaccc tggaaaacag acgggtctga gactgaaatt 
ggacttcant gtgtgtattt gtgtaaatga gtaaaacatt 
aaaaaaaaaa actcgancct ntanaactat tagtgagtcc 
atgataagaa tncattgatg aattttggac aaaccacaac 
atgctttatt tgtgnat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
757 



<210> 593 
<211> 766 
<212> DNA 
<213> Homo sapiens 

<400> 593 

tcttgaatnc tngttnntgc ctttttcgga tccctcgatt cgaattcggc acgagagaac 60 

attggtgtgt gagtgttttt tgatggtgca ggacccggag gtgctttcct tgccaagaat 120 

agaaacatcc agaatgctcc tccccatccc ccaatcccag acagcaatta tgtcagccct 180 

gtaaggcatt gcctgctctt gaccctttgg cccatctttt tatttttaaa aaattcccat 240 

gtcacagatg ccctgtctat gcagagggtg gcgtgggatg ggtgaccact aagtttaggc 300 

tggtgaaggt ggtgagccct tctgaggccc tgatagaact ttccaggagt tcatggtccg 360 

cggctccagc ttctcactgt aaagttgtca tcctggcaga ggcagccaat gcttttcatt 420 

ctagggggta gagatttatg ctaatgagtg aatattgcac cactagtgac tttctgttta 480 

aagttcagct cttagaaaat ggaatcttac ctgaccccta gtgaattatg tacataagca 54 0 

gggaatgttt ccaactagat ctccttcaga agagtccctg tgctggaata ggtcactgaa 600 

tcttatttgg ntttgtnaaa caaaagcttt tgggtctcgt ggggtgtgtg tgtgntttgg 660 

ngtgtgttgc cccntntgcc gtttcaaata aaaggtttgg taccaccttt tcaaaaaaaa 720 

aaaatantnt anntnanant nntntancnt ttnttnncnt tanant 766 

<210> 594 
<211> 754 
<212> DNA 
<213> Homo sapiens 

<400> 594 

ttgnttagga tcccatcgat tcgaattcgg cacgagggaa ggcagtggga ggagaggacc 60 

aagtctcaaa ctccagaagc cccacctccc tgagctcagc tcctctgcca agccccctca 120 

gcgcgaagtc ctcgtccaga gaaggcaacg gcgagaaaca aatccaacat cctgggctgc 180 

tttttccttc ccccactttt taaaagtttg gtgtccaagt cacttgacaa acccagaccc 240 

taacaatgat attttgtgta gaattctggg atcaaaatat aatttcaaaa ataatatatt 300 

ttctgacatc ccccaaaaaa aaaaanaaaa aaaactcgag cctctagaac tatagtgagt 360 

cgtattacgt agatccagac atgataagat acattgatga gtttggacaa accacaacta 420 
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gaatgcantg aaaaaaatgc tttatttgtg aaattttgtg atgctattgc tttatttgna 
accattataa agctgcaata aacaagttaa caacaacaat tgcattcatt ttatgtttca 
aggttcangg ggaggtntgg gangtttttt taattcngcn ggcgcngcnc caatgcattg 
gggccccggt nncccanctt ttggntccct ttaagnngan gggtaaantg ncgcncttgg 
cntaatcttt gnncatnggt tggnttnctg nggngnaaat tggttttccn ggnnanaatt 
tccccncatn ttangatccc nggnngntnt aang 



480 
540 
600 
660 
720 
754 



<210> 595 
<211> 767 
<212> DNA 
<213> Homo sapiens 

<400> 595 

ggtttaatgc tgttnnaanc cttcttnanc ctttgtacag catccctcga ttcgaattcg 60 

gcacgaggaa cgcttccatt ttatacctgt gtctagttag tttctgccta tctatccaag 12 0 

aagcttttat caagggtcca ccatgtgcca gccactgaag tagatataaa tacaaggatg 180 

tgtaaggtat ggatgatggt atacgaactg tcatcttact ggatttgtcc gctctgttaa 24 0 

agatacggtt ccgaaaactt tttaaagccc tagagagggc tttaaggcaa tgtagcatca 300 

tatatagagg catnaacctg ttcatatctt tctatttaac agaactgtgc acctgggcac 360 

aagggtgtgc acaacaggat gtgtacagca gcactgttaa agtgtancac atccatacta 42 0 

cangatctta tgcaactgtt ggaaagaatg aagcgatgct gcactgtggt catgcagtga 48 0 

tctctaagac atattaactc gaaagcaaaa ggtttaacaa tgtatnacaa actgggctgc 54 0 

aattgactcg cgcctgtaat cccagcnctt tgggaggctt gantaaggcg gatcacctga 600 

ngtcangagt ttgagaccaa acctggccaa tgttggccna aaccnctgct tctactnaaa 660 

ctacnaaaaa ttaacctngg gcntggttgg ctccgtgcct tntaatcccn gcttactcgg 72 0 

caatgcttga gngaangnan aattngcttt gaacctnggg gaggnng 767 

<210> 596 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 596 

tnttnaatnc tnttttaatn cttgctgcan gatctttcga tgatcccatc gattcnctgg 60 

tctcgaacac ctgacctcag gtgatccatt cgncttggcc tctcgaagtg ttgggattcc 12 0 

aggcgtgagc cactgcggcc agcacatttc cacttntaga tcctactcca taccacaggt 180 

ttcatttaag angaaaganc tanataaatg tgctcttntg gataccccac cctgacagan 24 0 

tgcattgtta cacagntanc atgggttgac actgcaanct ggcctgtcag ccatnggagg 300 

ngtttannga aaggcanatn atgtnactct gtgncagggn gccatntgct tacccntnac 360 

ctagcatang gggnttctac gggtgacccc nagcatattt ctaggttact tatgggcaga 420 

tttgtaagtg acaaaactcc agctgatgct gggaatgggg agagggccct tganggactt 480 

tgtggntttg tgcttctggt ttcctggcca accccagggt cacttgtctg gagcccagct 540 

gggcactaat gtctgccanc gactatntta cagtgtataa atgattcctc tatttgggga 600 

gagatcttcc aatccagaag agcccctntt ggactgcctg ggttaaatct gcatagcana 660 

agtggttgat gagtcatctg aagaaattca gccccaactt nncaacctgc ccttcctgnt 720 

tcctttttaa tggnggcctn tgg 743 

<210> 597 
<211> 786 
<212> DNA 
<213> Homo sapiens 
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<400> 597 

ngtttnnncc ngtttttaat ncnttgctac tngctctttt tgcaggatcc catcgattcg 60 

aattcggcac gaggacanac cgttgagagg acgtggaggc ccnttagggg gtntgcncng 120 

nanaggcaga ngtggccctg ggaacagagt tttatgacnc ttttnaccat anangaangn 180 

gagaatttna aagatatggt gggaatgaca aaatagcagn cataactgaa gacaacatgg 240 

gtggatgtgg agtttggnac ctngggatcg ngnaaagata ccagtgatgt ggagccaact 300 

gctccgatgg aggaacccac agtggtggag gagttccant gcanccngga agaggagtat 360 

ccagcctaag ttnctgactg gatgtcaaga agaaacccaa nttataanag atgactntan 420 

ntgantggnn aaatctttca gatcanncca gaccatancn tgagtttaac atccgnaanc 480 

cacaatccan tgnnccttac taagccgtgg tgattnacaa gtcataaatc cattanatga 540 

tgtggtnaaa gatgcctatn atgaccnatt ctccatngtt ntccngaaac ccgtcaattg 600 

acatcacatn tcctxittgga gattaaattt tnggtnancn tnccttcgtc cttgggcatt 66 0 

ngaacncata agaatgcacc cccnggntag gcccngtnna aaggttnatg aaggccntta 720 

taanttttgn nnccccaanc attaaantgg ctngattccc ttaatntttt cctcccnaac 780 

ccagnt 786 



<210> 598 
<211> 809 
<212> DNA 
<213> Homo sapiens 



<400> 598 

ngttttnnnn cnnnttttct aatgcttgct tctcgttcct ttgcaggatc ccatcgattc 60 

gaattcggca cgaggacaga ccgttgagag gacgtggagg cccgagaggg ggtatncncg 120 

gcagaggcag aggtggccct gggaacagag tttttgacgc ttttgaccag agaggaaagc 180 

gagaatttga aagatatggt gggaatgaca aaatagcagt cagaactgaa gacaacatgg 240 

gtggatgtgg agttcgaacc tgnggatcgg gtaaagatac cagtgatgtg gagccaactg 300 

caccgatgga ggaacccaca gtggtggagg agtcccaggg caccccggaa gaggagtctc 360 

cagccaaagt tcctgagttg gaggtagaag aagaaaccca agttcaagag atgactttag 420 

atgagtggaa aaatcttcaa gaacagacca gaccaaagcc tgagtttaac atccggaaac 480 

cagaatccac tgttcttcca aagccgtggt gattcacaag tcaaaataca tagatgatat 540 

ggtaaaaaga tgactatgag gaccattccc atgttttccg gaaaaccccc cattgacatc 600 

acattccaac ttggagatta aattttgggt aacccttcct ttgtncttgg gccttngaac 660 

ccntgaagga aggcaccccn ggtgaagggc ccngggggaa agggattcan ggnaaggggc 720 

cantaanaaa ccttttggga ccccccttaa nccaataaaa tttggtngaa ttgcnangga 780 

atggtttgnc cccccnaaac cccnaaant 809 

<210> 599 
<211> 759 
<212> DNA 
<213> Homo sapiens 



<400> 599 

tttntaatnc tttttcnaat gctngcttca ggannntntg cangatccct cgattcgaat 60 

tcggcacgag ccaggttagc tgctgaatca aagcttcaaa cagaagttaa agaaggaaaa 120 

gaaacttcaa gcaaattgga aaaagaaact tgtaagaaat cacaccctat tctatatgtg 180 

tcttctaaat ctactccaga gacccagtgc cctcaacagt aaagactttt ctttaataag 240 

agtacggtgc cacttgcctc aaaagttact atggtgctta agattgtctt gatctgacat 300 

atatcacctt ctgggttatt tactcattgt gccaggacct ggcattttca tgtgcctttg 360 

accaagtgtt cagaatttgc ttgactctaa cctggagagc ttcttaagtg atgccccttc 420 

atggagcttc tatgacagtg aataaactat taattgaagg aaaatgttat aattaatgta 480 
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tctatttgct gcattgtata tggattaaat gataaaaaac aagtaatcta ccctcagagc 540 

catgtatttg agaatgcttc aatcatattt tcctatgtac ctttttttta taaacttagt 600 

tttagactat gttgtaaaaa tggggaaagg ttgtaaacta tgtngtaaaa aatngggaaa 660 

tgtggcttta aaatatatnc attatatttg gttcaaggat tttggcaggg gntaaaggaa 720 

ncnatggttc aatctttgna tttatatacc ntgatttaa 759 

<210> 600 
<211> 769 
<212> DNA 
<213> Homo sapiens 

<400> 600 

ttttaatacn tttttnaatn cttgcttncg ntcctttgca ggatcccatc gattcgaatt 60 

cggcacgaga gcaattccac tcctagctcc acccacaggt aattgaaagc aaagacgcaa 120 

acagatgcct gtgcaccaaa gttcacggca gcatccttcg ccatagtggc agcatccgtc 180 

gtcacagcgg natcatcctt catcatagcg gcagcatccg tcgtcacagc ggcagcatcc 240 

ttcgccacag cggcagcatc tgtcgtcaca gnggcagcat ccttcgccaa agcggcagca 3 00 

tccttcgtca tagcggcagc atcctttgcc atagcggcaa ggtggaaacc ctgtccatcc 360 

actgaggcgt gcatagacta aacatggcca gtccaggcac tggaatccag gccgtanaac 420 

ggngcccacn gtcaaaagga atgagaccct gatgcactgg gcgacacaga cgggcgacac 480 

agacttggag acatcatgct aagtgaaaag ccaggcacac ggagcggacg gggtgatcct 540 

gctcacgtga tgtgtcccga atgggcacnt tcagagggga agaanggaga tggcgcttga 600 

cngtgnccgg gacnggggtt gggagcgacc ggttgttggt ttngggtttc tttctngggt 660 

gaaggaaatg tttttgatat tggggccgtt tgggtgatnt ttgcattacc ctttgaatat 720 

gcttanaacc cnctagaaat tgnnacactt tttaaatngn ttggaaatt 769 

<210> 601 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 601 

ntgtttaata ctattttcta atacttgctt tcgttctntt tgcangatcc catcgattcn 60 

aattcggcac gaggagacag cagcccccag ggaatgaagc tgatgccaga gtcagacccg 120 

aggaggaaga ggagccactg atggagatgc ggctccggga tgcgcctcag cacttntatg 180 

cagcaactgc tgcagctggn cctcaagtac ctctttatcc ttggtattca gattctggcc 24 0 

tgtgccttgg cannctncat ccttngnagg catctcatgg tctggaaagt gtttgcccct 3 00 

aagttcatat ttgangctgt gggcttcatt gnnagcancg nnggacttnt nctgggcata 360 

gctttggtga tnagagtgga tggtgctgtn anctnctggt tcangcanct atttctggcc 420 

agcagatgta nnctatatct gtgattactg gcacttggct acagagagtg ctggataaca 480 

gtgtagcctg cctgtacagg tactggatga tctgnaanac aggctcagcn atactcttac 540 

tatcatgcaa ccaggggccg gttgacatct aagacttgnt tattctatag ttcnagganc 600 

acaatggaat atgatccctt aactcctgat ttgggatcat ctgaaggacc aaggnnggca 660 

gtcttcgaag tggaataaaa tagccccggc ngtngtgact tgcacctata ttcccagact 720 

tttgggaggc naannttnga aggattgntt gccct 755 

<210> 602 
<211> 773 
<212> DNA 
<213> Homo sapiens 
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<400> 602 

nttgtaatag ctggtttcta aannntngnt ttcaacccct ttgcatgatn ccatcgattc 60 

gagcaaatca agatcttcag gtacagttgg accaggcact ccagcaagcc ttggatccca 120 

atagttaagg caactctttg tttgcagagg tggaagatcg aagggcagca atggaacgtc 180 

agcttatcag tatgaaagtc aagtatcagt cactaaagaa gcaaaatgta tttaacagag 240 

aacanatgca gagaatgaag ttacaaattg ccacgttgct acagatgaaa gggtctcaaa 300 

ctgaatttga gcagcaggaa cggttgcttg ccatgttgga gcanaagaat ggtgaaataa 360 

aacatctttn aagtgaaatt ngaaatctgg anaaatttaa gaatttatat gacagnatgg 420 

aatctaagcc tttagtcgac tctggtactc tggaanataa cacctattat acagatttac 4 80 

ttcatatgaa gctggataac tnaaaacaat agaaattgaa ngcactaaan gtgaattgtc 540 

atacaagcga aatgaaancn ttatttgana gccngcgggc ttctaacata ttgagcgata 600 

actttttgca aatgaaagat gcccttcngc tttntgaatt gnaaaatatt gaaacctgan 660 

agntnanctt agntgaattg aaacttaaat ttgaaccctg nacnanaccg gttaantgcc 720 

tgttcctgat aaaaanaagc cntnangtgc ttncctgntn gatttanccc ccg 773 



<210> 603 

<211> 784 

<212> DNA 

<213> Homo sapiens 



<400> 603 

tgctttntaa tagctgtttt taaatncntn gctttgcgct cnntttgcag gcatcccatc 60 

gattcgaatt cggcacgagg gggacatcag tgatcgtaag tctcctgggn ccgttattct 120 

canattaggt gacggagcta agacttcgag accatctcgt cctttntgta tcgcggaaac 180 

ctgangaacg agccggcggc ggtgacctgc acgagaagcc aggctaactg ggtgaagtac 240 

catgcaagca tttcttaaag gtacatccat cagnactaaa cccccgctga ccaaggatcg 300 

aggagtagct gccagtgcng gaagtagcgg agagaacaag aaagccaaac ccgttccctg 360 

ggtggaaaaa tatcgcccaa aatgtgtgga tgaagttgct ttccaggaan aagtggttgc 420 

antgcttgaa aaaatcttta gaaggnngca natcttccta atctcttgtc ttacggacca 480 

cctggaactg gaaaaacntc cactattttg gcagcaaact tgagaactct ttgggcctga 540 

acttttccga ttaagaattc ttgagttaaa tgcatctgat gaacctggaa tacaanttag 600 

nttcganaag aaagtgaaaa atttttgctc aattaanctn gtgtcaagga aaatngnttc 660 

anatgggaaa gccgttttcc ncctttttaa gantgggaat tcttngatga ngncnaattc 720 

ntnttganct taactgnntt angcagcttt taaaaaanta ccattggata aangagtccn 780 

aant 784 



<210> 604 

<211> 801 

<212> DNA 

<213> Homo sapiens 



<400> 604 
gttncnnctn aacccttttt tgaaatcnnt 
atncggcccc gtgtggggag acngacagca 
tatagcgcct cccctctccc ctcagagggc 
ggctctcgtg ctctaggacc tgcacgggac 
acttgggcca gagcaaanga aaacctcttg 
nggaggagtt ttagggcttg gggagaggga 



ngcttctact 


ctttggcatn 


catnccatcg 


60 


ccctttttnt 


ctggcatttg 


cccttgangc 


120 


acagctgcag 


gcctgaccaa 


ggccacgccc 


180 


ttgtggatgg 


gcctggactc 


tccagaaact 


240 


ttttaaaaaa 


attttnttca 


nagtgttttg 


300 


ggacacatnt 


ggaggaaatg 


gccttctttt 


360 


gcctggcaag 


cccaatatca 


cttgtttggg 


420 


cccttctgga 


agggtgtgng 


catgtnaant 


480 


atctatnaaa 


taagtatana 


tgggngtnta 


540 
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ntatatgtgt ntaaaataaa ngantggaca tatttggncc tctgngnana nncttngaat 600 

ctaagncaag agtnnntctn gaaaaacnaa ananagtnct ntntanannt ttacgtaatn 660 

atcaatactn tntccacntn accctnctnn tanntntncc natatantcg antaattcnt 720 

cactcntnna ttcctngtna acacnaatna atnnaactat naaatatntn tnctnnntan 780 

tngacatann catncnnncc g 801 

<210> 605 
<211> 759 
<212> DNA 
<213> Homo sapiens 



<400> 605 

gnttctaatg tggttcnaan acttgctttn gctcctttgc aggatcccat cgattcgaat 60 

tcggcacgag agcctcgcct gggccggcct gtggctccca ttttcctttc agcgggacaa 120 

aggggacttg ttaccaggcc attttctgga tggcctgtga gatctctgcc cctccaagac 180 

cctccaagtc tgagcctgac ccacagctgg gacactgaat tcagccctgg gaaccatggg 240 

ggcttctatc tggcaccagg ctgcagcctc cccaatccca gcccactttg ctgtgtctct 300 

ggcgggctgt cctccttggt gggagctgtc ctgcacactg taggatgctt aaaggtatcc 360 

ctggccttca cccatnccta gccagcagct cccagtcaga caacagccag aaatgtctcc 420 

agactctgcc cagcctccca ggtagccacc ctcgagacat gacctcagag tctctgtgtc 480 

tcctagaagc ctgacagaga cccccanggc agtgggtggg tggcgggcta gagacccttg 540 

cctgtgtccg ggaccctggc gccgntcttc cctcctgtgg atcccttcgc acttacaagt 600 

gttctnaant gggcagacgc ctgggcaccc cttgggccct gcccaancat ggccatngng 660 

cangcttttt naacccgcat nggntttcca ngcctggtga atcttgcttt tccanggacn 720 

nnttggaacc tttcctncgg ggcggggccc ccnagcnct 759 



<210> 606 
<211> 809 
<212> DNA 
<213> Homo sapiens 



<400> 606 

tctncgtnaa tcnnnnnttt aaaagccttt gcttttgctc nctttgcttg atcccatcga 60 

ttcgtgactt tgtacctggt ccaagctgat ggggttttgc tgctgttgac ccaggcagga 120 

gtctgactag agaacaaact aaggttgctg caacaaacaa ggacctcttc caagaagggc 180 

tcccaggcct ggcgcagtga ctcatgcctg tgatcccagc acttgggagg ccnaggcggg 240 

tggatcattt gaggccagga gttcgagacc agcttggcca acatgatgag accccgtctc 300 

tattaaaaat acaaaaatta nccaggcgtg gtggcgcctg tagtcccaac tactcaggag 360 

gttgaggcag gagaattgtt gaacccggga ggcggangtt gcaatgagcc aanatagcac 420 

cactgcactg catccttggg tgacagaagc gagactccat cttaaaagaa gggctcctgt 480 

gtctacgtca tggtggggct anagagangt cccngcagct gggctgtgtt gagtganngg 540 

ctnntctttt naannccagg caatagtttg tcttgactct gtccttttct gngtccacat 600 

gacattttac atntttncnn agtttnctta atttaaagtt gnctaatttt accattatac 660 

attttnaatt ggcatttctt ttaccnatnc tttttgtntg aaaatggtan tntttgaaat 720 

cngnatcngt tctaatgngn tntattttna ccnaatgcca atnntacctn ctttgnaana 780 

atntattcgt tttcnaagnt tnaacctct 809 



<210> 607 
<211> 788 
<212> DNA 
<213> Homo sapiens 
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<400> 607 

tntttctaat acnagtttnc aagncttgct ttnnnatccc tttgcaggat cccatcgatt 60 

cgcaaggccc gaggtgccat cccctctggg aagcagaagc ctggtggcac ccagagtggg 120 

tactgttcgg taaagagctc accctctcac agcaccacca gcggcgagac agaccccacc 180 

accatcttcc cctgcaagga gtgtggcaaa gtcttcttca agatcaaaag ccgaaatgca 240 

cacatgaaaa ctcacaggca gcaggaggaa caacagaggc aaaaggctca gaaggcggct 300 

tttgcagctg agatggcagc cacgattgag aggactacgg ggcccgtggg ggcgccgggg 360 

ctgctgcccc tggaccagct gagtctgatc aaacccatca aggatgtgga catcctcgac 420 

gacgacgtcg tccaacantt gggaggtgtc atggaagang ctgaanttgt ggacaccgat 480 

cttctcttgg atgatcaaga ttcantcttg cttcatggtg acgcagaact ataaagccct 540 

gtgtncactt atagacagtg aaaacccacg ggtcttcatc tttattaatc nngaaacctt 600 

ggaatgcctg ctttgttttg taaccccttt ttaaaaccta cctgttttta aaaagtggtc 660 

atttttantt nacgntttan aaanaaaaan tcctatttct ttttcctttt nattttaaaa 720 

aaaaattngn tttttgttgg ggggntttgg ggggaattaa aataatttgg cccccaactt 780 

taaaaaat 788 



<210> 608 

<211> 796 

<212> DNA 

<213> Homo sapiens 



<400> 608 

tcttttaatg ctttttncaa gccttgtttn aaatcctttg caggatccca tcgattcgaa 60 

ttcggcacga gactaccccg gctacggttc ccccatgcct ggcagcttgg ccatgggccc 120 

ggtcacgaac aaaacgggcc tggacgcctc gcccctggcc gcagatacct cctactacca 180 

gggggtgtac tcccggccca ttatgaactc ctcttaagaa gacgacggct tcaggcccgg 240 

ctaactctgg caccccggat cgaggacaag tgagagagca agtgggggtc gagactttgg 300 

ggagacggtg ttgcaagaga cgcaagggag aagaaatcat aacaccccca cccnaacacc 360 

nncaagacag cagtcttctt cacccgctgc agccgttncg ttccaaacag agggccacac 420 

agaatacccc acgtttttat ataaggagga aaaccggnaa aanaatttaa aagttaaaaa 480 

aatanccttt cngttttaca ctactgntgt agactcctgn tttcttcaan cacctgnaga 540 

ttcttgattt ttttgttgtt gatgntctct ccattgcttg tngtttgcnt gggaantttt 600 

atttaaaaaa aaaaaaaatt cttgtgagtn gactttggnt tttaaaccan tgntagattt 660 

taacngnacc cttaatgggt tgtacntata tgntttnaaa acatgnnaan aaatatttaa 720 

tgtaaaggnn ctgttnntaa atntaaccac ntanagaant tnnaaannnn ttnanccctt 780 

tagaacnatt nntgng 796 



<210> 609 
<211> 790 
<212> DNA 

<213> Homo sapiens 



<400> 609 
gnnntttaaa nacctntttc aatncttggt 
cgcatccagg gagaacctcg gggctgggac 
acagtcctca gtgccccaca ccggtggccc 
ttcccaaacg ctgcctcttg gtgacagact 
agacccttga gcttggggaa atatggaggg 
tcacggctgg cccggtgggc tgggacttcc 
tttttctctg gcaacaaagc ttgatgtttt 
aggggatacg gtctgtgact ctggacttgc 



ttnnaatcnt 


tttgcaggat 


cccatcgatt 


60 


acctcctggc 


cctcaccctg 


ggtcatgttt 


120 


cctgaggaca 


cctccaccct 


gaccttgatt 


180 


cagcccaaaa 


ccccttcctt 


ctgtctctgg 


240 


Sftgtgtgtct 


gcaatcaagg 


cctctgcagc 


300 


gtctgaattt 


taaatactta 


gggntcattt 


360 


cactgcttta 


gtttcctgtt 


tgctggtggg 


420 


tctgggggaa 


cagttgtcac 


tgcccccngg 


480 
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gagaggggca gcttgggctt ggaagaaagc acaccccnga gaccagagcc ccttcnagag 540 

ggatncttgg ctgcttcatt gnctttcccc cagcaagccc tgctcttcca caagcncctt 600 

ntggggtctt gggtatggtc ccccgntcac cttctttcca nantccctga nntggtgtag 660 

ggttgtgggt tggcacangg aattttgggg cattggggaa ggggntttca aaacttttnc 720 

caaanacccc cgtgttcctn ngnaaaattn aanttggtgg gcttngggtg ntnaccccca 780 

antcttngnc 790 

<210> 610 
<211> 786 
<212> DNA 

<213> Homo sapiens 
<400> 610 

gatgtttnnn annctggttc taatncttgg aaanctncnn ctttgttann ngcnntttct 60 

gcaggatccc atcgattcga attcggcacg agcccagctg gacctggtgg ccctttccta 120 

gtgcctctgc tgggggagga gaacctctgt ccacgtggag gctaggaggt ctcaggtgct 180 

gccctggcag caccagagtg tgggccgggc ccgagtgtct gcccctcggc cctcagggtg 240 

gggcacttag cacccagaag ggaccaaaag cagggcatgg cggtgcagag gagtttggga 300 

ggtgtaaaca gccccatgca cgtggaggag gagctggctt tcagccccag accccacgct 360 

agcactttcc acgctgcttg cccgctgttg atgtgcagtt cccagtgcct gtgtgagccg 420 

acatctgctc agtcctatcc ctcgtcagcg tgtggagacc cagctcctgc aagcccttct 480 

gcttccacgc ccccagacag cttggtggag ggtcctgcat ctgggccaag ctggggtgca 540 

cccagccaaa gacaaagctg ccttcacgtg cccaaaggat tcaagatggt gcactggccc 600 

cgggaggagt cttgaccaaa aatgggagcc cgctcttgtg gggaaanccc cgacttcccc 660 

caccnanaaa ccgntcccac ggtgccggan cttccccctt ttcctttgtg ggggcaacaa 720 

nattggcctt gggcnctttc aattnttncg gaagctttcc tgggtgtngg cttttgacct 780 

taaaat 786 

<210> 611 
<211> 938 
<212> DNA 

<213> Homo sapiens 
<400> 611 

tgttttaaag ccctntttng aatncntggc ttncgncccc ttggcaagat cnctctctgc 60 

aggatcccat cgattcgttt gtatttttag tagagacagg gtttcttcat gttggtcagg 12 0 

ctggtctnaa actcctaacc tcgtgatccg cctgcctcga cctcccaaag tgctgggatt 180 

acaggcatga gccaccatgc ccagccaaag atcatttttt tatatagact tcagnccttt 24 0 

gtaaatattg taactgggga gtatagagta gaaaaaaagt atagntaaaa catttgttct 300 

acaaattaac ctttaaaaat ataattactg ctaaaaatag agtgctgtta cacttaagga 360 

aaattagtgc cattttggaa atgagatctt gtgccataaa tncagctgaa ctgaatataa 420 

atgttcacaa attaatgctg tnaaaggaat gagttaagca gaaaaacttt taaccagcac 480 

ttctcaaaaa anaaaannna nnaattaaat nntataancn ncatnnanat ntatnntann 540 

tttncntctn nattncanta attttgtntt ncaaatantt nnacctnnan ctntgttntn 600 

nttnnnncna tnnantatcn ntttatcnan tatatnatta nctnattntn nngnanngna 660 

tcntnctcta tncnnnatnn tncatatnnc gtccnntnnn nnaantatgc ctcatnatat 720 

ntacnnnaaa ngtntangta tgnttantgc atnnncatna ctnntgatgt cnnagtnnna 780 

natattttgc cnctcattat tntgctnatn tatntgtttg acacannata ctnnnancna 840 

ttcatcttct cgcaatnngn gnacttttna nttacnnnna tgntannnnt natatatnta 900 

tcattagana ccttttnaat tntnnntncn nanacgcg 938 
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<210> 612 
<211> 771 
<212> DNA 
<213> Homo sapiens 



<400> 612 

tgtttgnaan nncggntntt gaaatncntg gtacnnaaac nctttngnaa ancncccctc 60 

nctgtntgat cccatcgatt cgaattcggc acgagataga aactaggcac tgatttgttt 120 

atatttntcc tgctcgagac acatgatgtt tcatgtatct gtggcttttt atagtttaaa 180 

ataatttctg gaaaagtcat agtcattatc tctttaaccg ctccctctct tccattctct 240 

ttgttctctc ttcctcgaac tcctgttagt catttgatcc tccatatctc tgaatatttt 300 

tgtatttctt ttattattta tttcttgtct ctgctacatt ttacattgag taaaagtggg 360 

atgtgacagt gggaaatcat tagtgactta gaaattccag ttggtcattg ggccaatttt 42 0 

gatgctacct tctctctttt atttctcact tcaaaataaa atttgcaaaa acaaaaaatt 480 

aaatatagta tgagtccagt tactggccta aggagctaaa agcattctgg gtttgtatga 540 

agacagctga gttataacaa atgagagtac tgttgtgtga ctgcattaat tattcccttt 600 

ttaaatgtac aagagcaang cattctacct gactgngtta ttgagctctg cancatacat 660 

ggtgacanag ctaaaacaan acaagccnaa ccnanaagga aaaccccagc tttagggata 720 

ctctgntcat ngaatatagc ctgaaaaatg gntaatcaag aaagtnaacn t 771 



<210> 613 
<211> 774 
<212> DNA 
<213> Homo sapiens 



<400> 613 

tttgaatcct tgctttcaaa tncttggcac tngccctctc tgnaggaatc ccatcgattc 60 

gaattcggca cgaggtaacg tgacacgtat tttacttctt ttantaggcg gacacacttt 120 

cttaaagtaa taatacgtca tggccctgct ataaggtagt agttctagaa gactgtntat 180 

ctaataattc agactaaagc tatttatatt gctgtgacac cacgtggaaa acttttataa 240 

ttccatctta tttctgatgt atatgtttta ttttctctgc cttcataaga actaaaaacc 3 00 

aaagttattt acgtgaaaac aagatttttg tttgagttca tttacttgag atatgtttaa 360 

aaaatccacc ttctgtcaca ctatagaagt atattttgaa ttatcaaaag gtagaattat 42 0 

aactttcana aaagaaaaaa atggtcaatt tantttaact ctatgtcaaa aatttattta 480 

tagtctcata tattcattcc acaccccccg ttcttctttc cttctttctc cctctgcctt 540 

nttcttaatn atnattttta aattctgacc aaaaataaag tngtggcaag tactttctta 600 

gcataacctg gactggttga agnagtaatt ctgntccttt aaaaaaantc cccaactggg 660 

ncccnggnca ggnacaaaaa nttntaanga acatntggga attangcnaa atggatnttc 720 

cttggaggtc caacccccaa aaatcattag gncnaccaaa attnaaaata atcg 774 



<210> 614 
<211> 754 
<212> DNA 
<213> Homo sapiens 

<400> 614 

ttggantctt ctcngaaacn cttngcnatt gcnctntctg naggatccca tcgattcgaa 60 

ttcggcacga ggttcttcaa agccaaccaa gacaggcttn tnagttttag agcttcagaa 120 

caaattgcca aaagccagag ttgtttatgc tagtgcaact ggtgcttctg aaccacgcaa 180 

catggcctat atgaaccgtc ttggcatatg gggtgagggt actccattta gagaattcag 240 

tgattttatt caagcagtag aacggagagg agttggtgcc atggaaatag ttgctatgga 3 00 
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tatgaagctt agaggaatgt acattgctcg acaactgagc tttactggag tgaccttcaa 360 

aattgaggaa gttcttcttt ctcagagcta cgttaaaatg tataacaaag ctgtcaagct 420 

gtgggtcatt gccagagagc ggtttcagca agctgcagat ctgattgatg ctgagcaacg 480 

aatgaagaag tccatgtggg gtcagttctg gtctgctnac cagaggttct tcaaatctta 540 

tgcatagcaa tccaaagtta aaagggtttg tgccactagc tcgagaggaa atcaangaat 600 

ggaaaaatgt gttgtaattg gtctgcantc tacaaggaga agctangaac atttagaaag 660 

ctttggaaag aaggccggng ggagaaattg aatgattttt ggtttcaact nccaaaaggt 720 

gtgttgcnct cccttctttg aaaaaacatt ttct 754 

<210> 615 
<211> 774 
<212> DNA 

<213> Homo sapiens 
<400> 615 

tgtttnaatg ctgttttgaa atcttgtttc aaatcctttg gctacttgct ctntctgnan 60 

gatcccatcg attcgaattc ggcacgaggg attctttcac tgagcacaaa gagttgttgg 120 

ggctttagca tctgactgat tttgttacgg ggttgattct gaccatagga agtatgcaat 180 

gtgaatcact atttacagag aaacctacaa cagatgcttg atgttgtaga aactgggaca 240 

tatagatacc aagcaaaatt ataagaaacc tataaggtgt tcaatacgct tgtgtttcca 300 

aaattcactg tncatgatca gtttggtgtt cttgtaccac agtttttaac tgaaggaacc 360 

agttgtaaca gtctcaattt ttaactaaaa cttgaagaac taanacaaca atgcaaacct 420 

ttcagcattg tttggccaaa cttgttaaaa ctgtaatgca agaaccaaat gcactgtgat 480 

gtggcaccaa ctaattagca agcatgaatt tttcacccaa nagtgaaaaa aggaaaatct 540 

accatggctt naagtttaag agcagaactt cctgactncc attctatgac tgatcaaaaa 600 

nactaatagt ttaaaacctn agcangcctt gttcacgata tgcngaaaaa aaaagtgctt 660 

gcagtttann atccttatgg aantttttca cantgtnaca nggtnttgta atacnttgga 720 

ngccctacat tttcntanga atntattttn cttggcctaa nttggnntca angc 774 

<210> 616 
<211> 769 
<212> DNA 

<213> Homo sapiens 
<400> 616 

atnncntttt tgnaatcctc tctgaaatcc tttgctactt gctctttntg caggatccca 60 

tcgattcggc cagtcctcac cttccctagt cctcgtgtgt attttaggag atgcgtgggt 120 

gtggaacagc ctcctgcctc cggtccaggt gtactggggt ctgtgtgttg tgtttctgcg 180 

tgttctcggc agaaagtggc atgctgtccc gcctgggtga tttgctcttt tacactattg 240 

ctgaaggaca ggaacgaatc cctatccaca agttcaccac tgcactaaag gccactggac 300 

tgcagacatc agatcctcgg ctccgagact gcatgagcga gatgcaccgc gtggtccaag 360 

agtccagtag tggtggcctc ttggaccgag atctcttccg aaagtgtgtg agcagcaaca 42 0 

ttgtgctcct gacccaagca ttccgaaaga agtttgtcat tcctgatttt gaggagttca 4 80 

cgggccatgt ggatcgcatc tttgaggatg tcaaanagct tactggaggc aaagtggcan 540 

cctacatccc cttnctggcc aagtcaaacc cagacctgtn gggtgtctnc ctgtgcactg 600 

gtggatngtc aanngcactc ttgtgggcca cacaanagat tccctttttg cctgcaanac 660 

cntgtntgaa acccccttaa cttatngccn atttncntna agcaaccctt aggcnanttg 720 

actnncnttc acaanttttt ggggcnaaag anncnaattg gcctgccct 769 

<210> 617 
<211> 766 
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<212> DNA 

<213> Homo sapiens 

<400> 617 

aganntcttc ctttctaatn nctngctacn ttctctntct gcaggnatcc catcgattcg 60 

cttcctcaaa gcatggttgc tgagnaccca nagttgcgag gngttttttt actgatttag 120 

ccaggtggca atcatgagtg aatggatgaa gaaaggcccc ttagaatggc aagattacat 180 

ttacaaagag gtccgagtga cagccagtga gaagaatgag tataaaggat gggttttaac 240 

tacagaccca gtctctgcca atattgtcct tgtgaacttc cttgaagatg gcagcatgtc 300 

tgtgaccgga attatgggac atgctgtgca nactgttgaa actatgaatg aaggggacca 360 

tagagtgagg gagaagctga tgcatttgtt cacgtctgga gactgcaaag catacagccc 42 0 

agaggatctg gaagagagaa agaacagcct aaagaaatgg cttgagaaga accacatccc 480 

catcactgaa cagggagacg ctccaaagac tctctgtgtn gctggggtnc tgactataga 540 

cccaccatat gggtccacaa naantgcagc atctctaatg aganttattc ttgcccttng 600 

ttcaangatc ttattgaaag gacatcttac agcttttccc aatgagaang cccangaagt 660 

gttaaacata ctgnnttgaa aaaagcactn tatnttntcc cntnttaana tggtntctaa 720 

aatgtanaaa naaannaaaa naaaanctcg atccctctnn aacnct 766 

<210> 618 
<211> 762 
<212> DNA 
<213> Homo sapiens 

<400> 618 

tttnnagnnt cttcctttct aatggcttgg ctactngttc tttntgcagg atcccatcga 60 

ttcgctcagt gcagcgatca tggctcagtg cagcctcaaa ctcttgggct caagcagtgc 120 

tccaacctca gcctcctgag tagctaggac tataggcaca cagcaccatg ccccggctat 180 

ttttttattt tgtagagatg gggtctcact atgttgccca ggctagtctt gaactcctgg 240 

cctcaagcaa tcctcccacc tcggcctccc aaagtgctgg gattaaaggc gtgagccacc 300 

gtacctggcc cttggtggaa tctttagggt tttctattca tacatataaa atcatatcat 360 

tggcaaacag agataatttt acttcctcct ttccaatttg gatgccttag atttcttttc 420 

cttgcctaac tgctctgtct agaactccca gcactatgct gaatagagtg gcaagagcag 480 

gcatttgcct tgttcctaac cttagagaaa aatccttcag ccttttacca ttgaggatga 54 0 

tgtttgctgt tagtttttca taaatgatct atatcaggct tgaataaatt tctatttcta 600 

aaaanaaaaa atataacnnn ntanttnatn aantnnttaa naaaanaaaa actggnacct 660 

ntaaaactta tagtngagtc gtttnaccgt anatcccana ntttgataan gatacattgg 720 

atnanttttg gacaanccnc aactaggaat ngcnntgnaa at 762 

<210> 619 
<211> 754 
<212> DNA 
<213> Homo sapiens 

<400> 619 

tttggagntc tttctttcta atncttggct actngntctt tntgcaggat cccatcgatt 60 

cgaattcggc acgagcggac ccatcggagc gtaacctgga tctccgcagg cctggcggag 120 

gccggccacc tggaggggca ttgcttggtt cgcgtggtag cagaggagct tgagaatgtt 180 

cgcatcttac cacatacagt tctttacatg gctgattcag aaactttcat tagtctggaa 240 

gagtgtcgtg gccataagag agcaaggaaa agaactagta tggaaacagc acttgccctt 300 

gagaagctat tccccaaaca atgccaagtc cttgggattg tgaccccagg aattgtagtg 360 

actccaatgg gatcaggtag caatcgacct catgaaatag aaattggaga atctggtttt 420 
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gctttattat tccctcaaat tgaaggaatn aaaatacaac cctttcattt tattaaggat 480 

ccaaagaatt taacattaga aagacatcaa cttcactgaa gtaggtcttt tagataaccc 54 0 

ctgaacttcg tgtggtccct tgtctttggn tataaatgct gtaaggtggn agccantaat 600 

tntctgcaan aagtangnca gcacttttca gtgatttgaa tatcatcttg gcttngangc 660 

cangtggaca accttgtcat aactgacttc tgaaaagaac cctntngata tttgatgcct 720 

cnggtgtngg tggaactgtc atttantngg anna 754 

<210> 620 
<211> 767 
<212> DNA 
<213> Homo sapiens 



<400> 620 

gcgttctttg aaagccctnt tttgaaaggc ttgcttctaa ttacgggaaa cctttgcaac 60 

tgcagatccc atcgattcga attcggcacg aggacccagg tagaccagct caagagttca 12 0 

tgttctttgt natcctcctg tgagctctct gtaagtcnnt ttcttgccca tcaccacatc 180 

cctagtactg ggtatcagtc tggccacttg gctttctggt ttgccccaat gtggtctatt 240 

cttgatgcag ctaccaaagt aatgttttaa aaccattata ccaagttact atccttgtca 300 

aaacccccag taactgccaa tctcacttag aataaaatcc ggactcctgt gaagcacagc 360 

ataaactggc cactgcctat gcagcaacct catctttacc gnttcctgcc ttgctcactc 420 

ccttccagcg ccgttattct tcctgatgcc cctagtacac aacaactcct tcctgctcca 480 

agagtaggaa aattactggt ctctctgcca gngagaancc tcttctggna ttacctttgc 540 

ttcattgcng aatcttctnc aatatcatct tctaaaaaga gccttttaaa aatcaccttt 600 

nctatnatgc cctactcatt tccagtccct gaaanggcca ttcccacttn antannactt 660 

attgctaacn tgaaatacac taaatgnnan ccttcatgaa nggtanggca anttaaatgc 720 

nttngcactg gnnaggcnaa gagaacaagc ancntggntt canaagn 767 

<210> 621 
<211> 828 
<212> DNA 
<213> Homo sapiens 



<400> 621 

tttctaatag cttgctttct aatnctnggn aacgctnggt ctctgnagga tccctcgatt 60 

cgaattcggc ncgaggggtg acagagtgaa actcgtatct ccaancaaac aaacaaaaag 120 

tncttaaaca tatgtgaaca aaaatttngt gatggaagga ttctagttaa tgagtattgc 180 

atcaagattt acatctttct tactaaggaa aagagttaat aaaaatngnt ctttatttta 240 

caggcagnta ctgaggctct tcccanntcn cagtanacag ccactcagcc ttgaaaatgg 300 

agtgttgttg tttctaaaca tatatttatg tcatttattn aggtacagtt cacttaaata 360 

accataagtn gantctctct tgtnagtgat ttgggtagga agaggccatg tctanagttc 42 0 

natttctctg ttgggtccna ntgaaattgg accttttnag ttgttcanaa aaatnaanat 480 

aaattnctca tattaaatca agannctcnt caanttatag atgtggggta gggttccnng 54 0 

taaaacccat tatnaatcta gaaaattatc nctatngana angcntttaa tatctnttac 600 

cntgaaattc attactttag tncaaggcct acctttaaan gtttnnncnn gaaccatttt 660 

tannnnttcn ncttttgnnc caananntca ttttaancca ccaaaantcn caattnttnt 720 

tncattnnaa tannggatgn naattatnnn atcnatgtgt catatttnac canganaata 780 

ctgngctncn tgnaataatn ggtacactaa anncnngann tttnntcn 828 



<210> 622 
<211> 784 
<212> DNA 
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<213> Homo sapiens 



<400> 622 



gtctttgaaa 
agnncccatc 
agtcagaatt 
ttgtgaagga 
aagattcatt 
gaatttgcaa 
attgaagatg 
agaatttttt 
gcaatttaaa 
atnggtattg 
acangaaaat 
nncnctnctn 
ngcanncttn 
ccct 



cctttttcta 
gattcgtttg 
tatagctttc 
aaataaaaga 
ccttagtctt 
aataaagcta 
agcagagcag 
taactcttca 
acattntggt 
ntacgctgta 
agaactttat 
ctcnaaatcc 
tnnncccnct 



atncttgctt tctaatnctt ggcnactcnn ctctcnctgc 
ctttcagtgg ttggctttca ctgaaagaaa gtgtaaanaa 
actatgtcca agactaggac tgggttataa agattttctt 
aaatttgcca ctactgcatt tactttacta ttgtaaactt 
tggaattttg atgtctcaaa accagatgag tggaagtgct 
agaatgctta actctgcact ttaagttcta ctctgaccaa 
ccctgaacag catttngttt atacagtctt gtttaagaat 
tttnttgtct ctgtggaagc tgtgtaactc tttttaaaat 
attctaacaa ttctctcaan aaacagcatt tccaatggna 
ccttatgtat tncctgtacc tgaacacttg atgctgcctn 
gttaaaaaat aaaagtctgg tncttctttg naaaaacaac 
ncnacannnc tnnnaatntn ctaanntnag tctnnnttnn 
nanctccctn tntcntnttc atatctanan tnacanccct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
784 



<210> 623 
<211> 1164 
<212> DNA 

<213> Homo sapiens 
<400> 623 

gggacttntt angccntttt cgaaatccnt tncttccnaa tcccttngca actntcnnct 60 

ntctgcanga tcccatcgat tcgaattcgg cacgnagnga gcnnattcnc gttttnagng 120 

ttctntttct ntnatnnaca ngngaaantt ccaggnnatc ntgnnnccnt atctgantna 180 

ngctngnttn aacntngnna caccnngnct nnnaancaaa tttnanaaaa gggnancncn 240 

nanancatnn nanntnncca atctaccaaa atcanaacac ncantgaaca acacananna 300 

tnnnatacnn tctacnccaa ancnncncat nncacgcacg ataanacanc nnnnaaaaan 360 

ancnaancan atatcanann caaccntana cnannaatca nacnctnanc tcccncacag 420 

canngngacn aanaacnanc antgataaan cncacctnnn tannacacac ctnannancc 480 

nntntantcc cgaataacca atngccacnn ctannccnat aacanantcn ctnanccctc 540 

ntgcatcaaa ttantaaatt cncnancata aagnanatca cagcctcntt cnaccnntga 600 

tcnaanctnt anaccnangn nanncnntat naaacnctat ancantnnna ctnnaacntt 660 

nnatcngcnc ntanaaatta aanatcnaan actcaatatn ncggaatant nncntctcta 720 

nataannnta naacggngna aanacncctc anacataann gncntacnna tcgatctatc 780 

anntnancat aaagtcaccc gcatattnac cnacgnncaa cataannnaa atnctactct 84 0 

cagaccatat aaatntcgcn tccntanatc agngcnanan tacaaanacg tcgcnnnngt 900 

ntggaccaca cgncntagat aaacacnnat aaacantttt tanatgtaac acatttcnna 960 

tctatnaaat ancatcattn atgnanacga tnacaacaaa nnctacncna tgntactaaa 1020 

nacaantaaa nntnanatta aaaaagttgc aannatncng ngaaanntcc cnanaaacan 1080 

tanatncnta tttannnntn acnncggngt nnccntaaaa anaactctnn nntnnctggn 1140 

ttgtanatnt annncnanct cgcg 1164 

<210> 624 
<211> 798 
<212> DNA 

<213> Homo sapiens 
<400> 624 
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ttgttaagcc tnttttcnaa ntccttcctt tnaaatcttt tgnaaacctt ggtanttgca 60 

ggnatcccat cgattcgagt aaagcatcct gcctcagaat gactttccta tcatgcttta 120 

tgtgtcattc caaggtttct tcatgagtca ttccaagttt tctagtccat accacagtgc 180 

cttgcaaaaa acaccacatg aataaagcaa taaaatttga ttgttaagat acagtagtgg 240 

accctactta ttcagtcaat taagagtaag tttttttatg tggttattaa aacagtatga 300 

acaattagtc taactctgca tagacagggt ctagattttg ttaacccaaa tgtataactg 360 

cagttagctt aaattacaat ttgaagtctt gtggnttnta tatagctngg cactttatta 420 

ctcttttgaa ctgaaagcac actcccttat aggttcatgt aactgtcctg taataaggtg 480 

cttataaatg ggaacaacta cacagcctag ttttgncaca acctttagca tctaaaaaag 540 

ttttaaaagc ttcttaaatg nctaatataa anggagatgc tnatanccac aacatctatt 600 

ttaccaatat tngtttcctt acacttacct tgggantttg cattgagtga ngttttngta 660 

aaccccaaan atncccatta atanaaaaaa nttggtacgt tttnatgact ttaatccann 720 

ttncttgtng gnnttcncct aaaangcttn ccnnnggnnt ggaantnnna ntnatttntg 780 

gggnaaggtt tnngttnt 798 



<210> 625 

<211> 793 

<212> DNA 

<213> Homo sapiens 



<400> 625 

ttcttaagcc ncttttctaa tgcttgcttt naaatctttt gnaancgctc ggctntntgc 60 

aggatcccat ccgattcgaa ttcggcacga ggaaatgcct ctatgtangt gaagtgttct 120 

ctctgcatgc aacagtaaaa attaatataa tattttcccc acaaaagaaa cacttaacag 180 

aggcaagtgc aatttataaa tttatatcta aaggggaatc atgattataa gtccttcagc 240 

ccttggactc taaattgagg ggattaaaaa gaatttaaaa taattttgaa cgaatttatt 300 

ttcccctcag tttttgaggg cattaaaaag gcattaaatc aagacaaatc atgtgcttga 360 

gaaaaataaa attaatgaaa acacagcact tatgttggtt tagctgcagc ctccttggag 420 

gtagaattta tttatttaaa attactggtt gcatcaagaa ccccataggg tgtacaaaag 4 80 

gttctataaa atctgcatta tagagacaaa gangcaggca aatncatgtc acaagggtna 540 

agcttacagt ttacaaactg gggaacgccc agggtgtang atttnaaaaa cgncactctt 600 

gagaaaacan atgtaatcan ggntgctgaa aactttgcat ggnggctttn aagacattta 660 

gnccttgttc aaaccaaaat ttnttggnat ttgccagatt ccttantntt gccatgggcc 720 

atgacaccat ttttggcctt tatgncnctt taaaattttn aattaaaaat accntttcca 780 

gtaannctaa ttn 793 



<210> 626 
<211> 825 
<212> DNA 
<213> Homo sapiens 



<400> 626 
ntttgaatnc ctttgnaaat ccttntttct 
ctgnangatc ccatcgattc gaaacggcnc 
anttacatag tgatnaatac ccatctatgt 
atggannttn cctttaanct ctagatccat 
ngaaactgaa cacatctgat acagaactct 
gcctgntcct nnttcacgct tancacttat 
aagcttaaaa cctangattc atgcttgact 
cattagcaat tcttgctagc tctaccttca 
gnttcccttg cctncncatt tcccatctgc 



aatntntgga 


tccttggcna 


ctcgctntnt 


60 


taggaatcat 


cgaaggttga 


gaccgtgacn 


120 


actgnngcct 


nctaaatgtn 


tntctncnnn 


180 


tgacancctg 


ancatntcta 


aaaggcatta 


240 


gcattnnctt 


ccnaantntg 


cccannccna 


300 


natatgatcc 


cactattcac 


tnantctctg 


360 


actgnataat 


nntacaatct 


actcctaatg 


420 


aaatatattc 


tgaatagact 


atntcttgcc 


480 


accccttctc 


tnctncccaa 


aatcaataca 


540 
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ctagntgttt ctaaaaaaaa tatnganann tagnnnaaaa ncntaaataa atntaaaana 600 

angnntancn tnacanaana ttnctaatat aggnnanntn ntgncaanaa cnntaantnt 660 

tnaatacgnn aaaactctct cnaanngann aanntatnnn agttaaaagn naaatannnn 720 

aanantncca aatntanaag ataangncat aannntatna gncnnaacgc taantgnnga 780 

tganntntaa tnngnatana nnantngtta nnacaaaatn tacnn 825 



<210> 627 

<211> 772 

<212> DNA 

<213> Homo sapiens 



<400> 627 

tttttaatgc ttngtcgnac ttctcccagn aatcgnttng aaactcngcn actcgttctc 60 

tctgcangat cccatcgatt cggaaatttg cactgatggc tcanaaggct tacgttttgg 120 

agagtatgac ctacctcaca gnagggatgc tggaccaacc tggctttccc gactgctcca 180 

tcgaggcagc catggtgaag gtgttcanct ccgaggccgn ctgncagtgt gtgagtgagg 240 

cnctgcagat cctcgggggc tngggctaca caagggacta tccgtacgag cgcatactgc 300 

gtgacacccg catcctactc atcttcnagg gaaccaatga gattctccgg atgtacatcg 360 

ncctgacggg tctgcagcat gccggccgca tcctgactac caggatccat gagcttaaac 420 

aggccaaagt gagcacagtc atggataccg ttggccggag gcttcgggac tncctgggcc 480 

naactgtgga cctggggctg acaggcaacc atngagttgt gcaccccagt cttgcngaca 540 

gtgccaacaa atttgaggag aacacctact gctttanctc ngaccgtgag acacttgctg 600 

ntnccntttg gcaaagacca tcatgganga ncanntnggt nctnaancng nntggccaac 660 

atnctcatca acctgtattg gcatgnaccg cncttgctgn acnncngngc caaancnctc 720 

nantccgcca ttggggcttc cggnaaccac tnnacaccaa ggttctnttg gc 772 



<210> 628 
<211> 808 
<212> DNA 

<213> Homo sapiens 



<400> 628 

tcnctcgnaa cntttnannc ttggctactc gntctctctg caggatccca tcgattcgaa 60 

ttcggcacga gatgacatcc tcattatcca cantgcaaag ccaaccatcc ctatgatggg 120 

ttcattgtgg atcatgactt antgggtcaa gagtttggaa gtggctcagc tgggcggnct 180 

tctgctncat gtggctgcca natggtnccc tgctggtnng cagnctngtc tagagggtcc 24 0 

atgatggctt tactcacatg cctggcatct tgacagggac agctggnang caaagnnnat 3 00 

ctgggactgt ncacagagct ncttcntgtg gcctttccag catggtggtc taagggtagc 360 

tggacttnct gcatnacagc tcagggctcc cagagctact gtcccaagag atnnaaagtg 420 

gnaactgnca atcttttang ctaangncca gaaaccatta cccctgcacc ncacagtctt 480 

tttntanctg ntgaaataaa cattnnnttt atcaattnta ancattcgca aattggaatt 540 

aaataccttt tactaatttt gncgtgacca tctgcccctn gttcaagatc taaaaaactt 600 

ttatngntca tcntgnngat ntaaaaaact nttgtgttng catttanaac ccntaagcan 660 

nttnggcant tanannnaan annttnnnaa acccttntat anaaccttat taagttgang 720 

catnngnant ttcnccttna aatccnaggt ccttagggct angnnatacc nttcntatng 780 

naactttngg gaacctaaan cctctcct 808 



<210> 629 

<211> 827 

<212> DNA 

<213> Homo sapiens 
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<400> 629 

ggccnncttt gaaccttntt caaatcnttt ggcactcgcc nctctctgnt ngntcccatc 60 

gattcgctgt gatccaaggc atgaaaagag tgcaaggtaa ncangnggca gcnttnatng 120 

aagcatnaaa taangcnaaa gcnnatgctn anctnangga gcangnngct aaagacaacc 180 

acannctanc tgnntnctaa tcatgctntg cttnctnang tgancttata gnaacgcant 24 0 

nagactncan gcnttgcttg gcncaacaag gnnacctana ntcatnanga agcnnttgaa 300 

ctaangagtg gctacnncct ttnntnctca tgcntgacct gtaatnattc ttctganttg 360 

aggcaanagc gggtnnaant natngntnan ntgnaaanac tntnnnatcc gnnnntnctg 420 

attannttnc attnntntna atgatanann ctcatcnngc tcgncctgna ctttganang 480 

ctnnnntcnn anntnnctga ctttaggagn nnacctncag cganatgtna agnanngaaa 540 

ttnanntncc tnncgntccn ccttgcngac tnanngtctt gngnaacntn angtanntan 600 

tctacngggn gnnacnttgg nnaattgggg ncttataaan tnttcntnna agaatgantg 660 

naccaattnt nnaanntcta agnttgggga aatctnngtt tcctgnatnn gnacaaaaan 720 

tcgatttann ngncnngntt nnttgggcnt catntgccat tgatgcnatt cnacttatgt 780 

cctcntggng cntnttnnaa nnnggnngnn aacatttttt gtgtgcc 827 

<210> 630 
<211> 793 
<212> DNA 
<213> Homo sapiens 

<400> 630 

ttcnaatgct tggncnngag tccncctttg aacnttttca aatnncttgg caactcgcnc 60 

tctctgcatg atcccatcga ttcgaattcg gcacgaggcg ngttgttcta cactgcnntc 120 

ngaagntttn ntaanaagcc accacttagc ngaggcnnct acangtcttg gggncttagc 180 

gaagagaaat cncgctggca cttgncccgt tcacntaagn actnntgnct gantccnagg 240 

gtannngtnc accttgngnn ccancagaca nacccaannt gncntaaaan gggcaggtct 300 

aagcttacnc tngactncac nggcaagctg nangcctgtn ctgccttccn ctgcnntnac 360 

aaatngacag tnngaccaag agtcanagna aaaactncaa ggatacatnt atcccantct 420 

nttctacacc tntanattcc ntganctatt gctcanaccn atcgtgcggg caaaggcaag 480 

acttgggcaa cattnttnaa tacaatgatg ctgacaanta atttccngct ngttgccagg 540 

natntttacn cgagctnttg tgattccaaa ctaaagaatg gngccnnnan gcccntcntt 600 

antnctggnc ccccanaang ancctaactn gcgaaaggnn agnatggcat tnacccaaac 660 

caacttntng gattacnnca actccanaan atccgacggc atnnaanang caaaacaaca 720 

acttcncnan natnnaanna atngnnccnn aaananaacc cgngcntctn aaacnattgt 780 

ggacccatnc ccc 793 

<210> 631 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 631 

gnagtnncct tngancctct ntnaaatcgc tttngcnant cgcctctttc tgntngatcc 60 

catcgattcg aattcggcac gagatgttac agacatgaaa tatgaacaga atnctaaaag 120 

aacataaaag aataagagct ccttaaagat tataaataaa tggtgatgtt aaagtaatag 180 

caccattgga cgaagctagg gaatcaacac ttgacagaaa gatacatatt ttttttatac 240 

aaactacata tatttgagca atcaagtagt agacatagag aattttcttt ttatggaagt 300 

actctaataa gtaaagggct gatagaatta tatcagcatt ttctagctcc tggtgaatta 360 

tgcattgggc atccatggct gccttagatc acaaaaatac caccagatat atgcctgtgg 420 

atgaaagatc acaccaccac ctgtgaaata gtcttcccca caaaaaatcc aacccaaatc 480 
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ctatccagcc tgtagatggt actcgagatc ttctataaga aataaagaga gcangctggt 54 0 

cacggtggat tgtgcctgta atcccagcac tntgggaggc caangcaggt ggatcgcctg 600 

angtaaagaa gttcnagacc agcctgccaa catggtgaaa cccccctctn tacttaaaag 660 

taccnaggat gagcccggcc gttgtggcaa gcacctgtgg tccccagcta cttgggaagc 720 

tgagcangaa aaatcgcttg aanctgggga ng 752 

<210> 632 
<211> 751 
<212> DNA 
<213> Homo sapiens 



<400> 632 

gnnnnnnttn nnnnnttcta atgcttggct actcgttctt tntgcaggat cccatcgatt 60 

cgcaactaga gaagattgga cagcaggtcg acagagaacc tggagatgta gctactccac 12 0 

cacggaagag aaagaagata gtggttgaag ccccagcaaa ggaaatggag aaggtagagg 180 

agatgccaca taaaccacag aaagatgaag atctgacaca ggattatgaa gaatggaaaa 24 0 

gaaaaatttt ggaaaatgct gccagtgctc aaaaggctac agcagagtga tttcagcttc 300 

caaactggta tacattccaa actgatagta cattgccatc tccaggaaga cttgacggct 360 

ttgggatttt gtttaaactt ttataataag gatcctaaga ctgttgcctt taaatagcaa 420 

agcagcctac ctggaggcta agtctgggca gtgggctggc ccctggtgtg agcattagac 480 

cagccacagt gcctgattgg tatagcctta tgtgctttcc tacaaaatgg aattggaggc 540 

cgggcgcant ggctcacgcc tgtaatccca gcactttggg aggccaaggt gggtggatca 600 

cctgaggtca aggagctcga gaccagcctg gccaacatgg tgaaacccca ttcttttctt 660 

aaaaatacca aaaaatttag cccangtgtt gaatggntgc atgcctgtaa ttcccagctt 720 

ctnanntagg ctnanacaag gagcttncnt t 751 



<210> 633 

<211> 806 

<212> DNA 

<213> Homo sapiens 



<400> 633 

ttnnannncn ttttnaaaag gcctnnnntt gannctttcn aatgcttggc tactngntct 60 

ttctgcanga tcccatcgat tcgaattcgg ctntagggaa ggggagggtt ggtgagtccc 120 

agaccttaaa aatacaaggt taagagggac cccaaagcaa aaaattccaa cccttttcct 180 

cccagtcatt gaaacaccaa aactattata ccggagggtg taatagtttt gctgcccagt 240 

tgtggtaggc cagtagtggc ctcccaagat gcccatgtcc taatcccagg aacctgtcaa 300 

aattaccttg tatggccaaa ggggctttgc agatgtaatg aagttaagga tctttcgcca 360 

ggaagattat cccagcttgt cangagggct tgatgtcctc acccgggtct gtataacaga 420 

agagcaggtg acgggagagg aggttggagg tgtancgatg gacangaaac tggagttata 480 

ggagggcagc tnaagccaca gaatccaggc cancttanga gcccaggaaa atgcatttct 540 

ttccacaaaa gcccttggaa ggccccaanc cctgcttccc acccttggac tnggcttcaa 600 

tgaggcttaa tttttataaa ttcntggctt gattttagaa ctcntaaggg gaaataaatt 660 

ttgtgttngn tttaantcan aaaataaatn aattaaaaaa aacttgaanc ctttanaaac 720 

tntantggaa ttcntattan cttaaanccn aancttggat taaaggatnc atttgtttna 780 

anttttggga cnaaccccca anttnt 806 



<210> 634 

<211> 775 

<212> DNA 

<213> Homo sapiens 
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<400> 634 



ngggacttcg 
attcggcacg 
attgaattaa 
tgcttttact 
acacttacaa 
caggcccatg 
accagccaca 
gtgatagcct 
cctatgcctc 
gagtcgnaat 
ccaccaacct 
tattnngctt 



ccaacccaac 



cctnacgaac 
agtataaact 
nncaatanaa 
aatttatcca 
accacttaca 
aggcatgcca 
aaagatcctg 
ccctccagtc 
tcaagctcaa 
tacgtagatc 
aagaatgcan 
aaatttngnn 
caattggcaa 



cgctnggaaa 
ttattttatt 
ttttagttcg 
agattagatg 
ataccagcat 
gccagacgac 
tgtcagaagg 
cacacaactg 
gggtcggatg 
caagacatgg 
tggaaaaaaa 
aaccatttna 
ttccatttca 



tcccntntnt 
ctcttctggt 
ggttgcttag 
tgattactat 
gcttctatca 
cagacagcat 
ggaaacaggc 
gtactgctgg 
tccatgtnct 
gtaagataca 
tgcttaattt 
taaagnctng 
anggtttcaa 



gnaggatccc atcgattcga 
tttgtgttac atgacaagaa 
gtttttactg ctcccattct 
ttaataataa tttagtcctc 
ctgtaattct attcaattct 
ttatagagtg ggcactcaat 
ttggaggctt ggagtatgtc 
ggctgaaact agaactcang 
tcgcctctag aactatannn 
tnggatgagt tnggaccaac 
ggtgaaaaat ttgtgatggc 
cnantaaaan aaaggtttaa 
gggtccaang ggggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
775 



<210> 635 
<211> 784 
<212> DNA 

<213> Homo sapiens 
<400> 635 

ttgagngtcc tnctttnacc ctttcnaatn gcttggcnac tcgctctntn tgnaggcatc 60 

ccatcgattc gaattcggca cgagatatag ctctggaggt caggacatag gagatattga 120 

ttcaggactt gccagagtat ggtcttgggg tgtgccctga tattacaaac agggatctta 18 0 

gtggctaggt gatgaggcca tggcaaatgt agatggacca agatcaattt gcctttctag 240 

atgaggtttt ctaggtgaaa tgtttttgaa actattttgt agcctagtat aatttataaa 300 

agtagagaga aactataaat ataaatttgg aangggttag ctaaaaggag aaaacagcan 360 

aatcttcata tatatanaaa tggatattaa tttgctagaa ttaanagact gcaggtaaag 420 

atagnttttt ttaatacctc tttttgctgt anaaaggaca ggattaaatg atnaagggat 480 

gctggaatga ggaatggtaa ctttaggcaa gatagtcttc tgngacggct gatatgaaca 540 

atngagagta anacatttnn aatacaanaa attgtcctgc tgctcaccca tcaagccttt 600 

tcangtttct tcccttgcca aaantngtaa naacttntgg tacttttnna ncttgtatnn 660 

ttccngttna ttggttanaa ccccttcgat naanaanncc atantttnaa tttgggnttg 720 

acccccnagg ttaaaanttn ccntttnctc aatttcccct tttcaaagnt ttaacntaat 780 

taan 784 

<210> 636 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 636 

ttnnannctt tcnaatnctt ggcnactcgt tctttctgca ggatcccatc gattcgtcct 60 

gcgcaggagc cgcagggccg taggcagcca tggcgcccag ccggaatggc atggtcttga 12 0 

agccccactt ccacaaggac tggcagcggc gcgtggccac gtggttcaac cagccggccc 180 

ggaagatccg cagacgtaag gcccggcaag ccaaggcgcg ccgcatcgct ccgcgccccg 24 0 

cgtcgggtcc catccggccc atcgtgcgct gccccacggt tcggtaccac acgaaggtgc 300 

gcgccggccg cggcttcagc ctggaggagc tcagggtggc cggcattcac aagaaggtgg 36 0 

cccggaccat cggcatttct gtggatccga ggaggcngga acaagtccac ggagtccctg 42 0 

caggccaacg tgcagnggct tgaaggagta ccgctccaaa ctcatcctct tcccaggaag 480 

ccctcngccc ccaagaaggg aagacaagtt cttgctgaan gaacttgaaa cttggcccac 540 
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ccaactgaac cgggacccgg tcatgcccgt tccnggaaan gtctattata aaggagaaag 600 

cttcgagtca tcanttgang gaanaagaag aatttcaaaa gccttcgctt atnttcngta 660 

ttngcccgtg ccaaacnccc cngctttttn ggcttaccgg ccaaaaagaa gccaanggan 720 

gcccnnanaa cagggatntt gaaaaagaaa naatnaaacc ctcnn 765 



<210> 637 

<211> 853 

<212> DNA 

<213> Homo sapiens 



<400> 637 

ttttggancc nttctttgan nctttctaat gctgggntac tcgntctctc tgcaggntcc 60 

catcgattcg aattcggcnc gaggatcagc ccacctcggc ctcncaaagt gctgggatta 12 0 

caggcgtgag ccaccttgcc cagcccacat catacagttt gaaatgaaac tttgccacaa 180 

ccagcctttg ctgtagcaca cacatatatc actgaacctg tttgaaataa agtttttttt 240 

ctttntcctc tggtattctg ggttctgaag tctggtattc tggtattctg ggttcaaaag 300 

tatgacttga gagtgttgct ctggtattct gagagttgct ctgtattctg ggttctgaag 360 

attatttgaa aaataactcc tactacattg aaatgcagac ttaaaaattt aaacattgga 420 

ttangcagtc aaaaaaacca agcaagcata aaaggtcaat aagttgtaat cttgatagta 480 

aaggtggaaa acttattata aatggnaang aaagttttat ttcctttttt gtttgaatgg 54 0 

gcaagtatgc catattatac ccaaaagttc ttttaaaaaa atatttccca ttcaacccat 600 

ttttaattna aaattaaaac cattttgnaa gggaaanttt acccaanggc aanccttttt 660 

tttcctccaa aaaggttnac cntgttnatc cttctttttn ggnaaattta nccaccaatt 720 

tttttaaagg ngggncaatg gggnttaaaa ntanccctgn aagnnatttt ttnanccttc 780 

caggtttaaa antccccttg gatngggtct taacctgggn gggtngnata naaaaaaata 84 0 

natcctnttt anc 853 



<210> 638 

<211> 740 

<212> DNA 

<213> Homo sapiens 



<400> 638 

anttgntctt tntgcaggat cccatcgatt cgcagcaaag actttatttt tgtacagaag 60 

atggtgaagt ccaagacggt ggctcagtgc gtggagtact actacacgtg gaaaaagatc 120 

atgcggctgg ggcggaaaca ccggacacgc ctggcagaaa tcatcgacga ttgtgtgaca 180 

agtgaagaag aagaagagtt agaggaggag gaggaggagg acccggaaga agataggaaa 240 

tccacaaaag aagaagggag tgaggtgccg aagtccccgg agccaccacc cgtccccgtc 300 

ctggctccca cggaggggcc gcccctgcag gccctgggcc agccctcagg ctccttcatc 36 0 

tgtgaaatgc ccaactgtgg ggctgtgttc agctcccgac aggcactgaa tggccatgcc 420 

cgcatccacg ggggcaccaa ccaggtgacc aaggcccgag gtgccatccc ctctgggaag 480 

cagaagcctg gtggcaccca gagtgggtac tgttcggtaa agagctcacc ctctcacagc 54 0 

accaccagcg gcgagacaga ccccaccacc atcttccctg caaggagtgt ggcaaagtct 600 

tcttcaagat caaaagccga aatgcacaca tgaaaactta cangcagcan gaggaacaac 660 

agangcaaaa aggcttaaaa aggcggtttt tcagctgaaa tggcaccnnc aattganagg 720 

actacngggc cccgtggggg 740 



<210> 639 
<211> 774 
<212> DNA 
<213> Homo sapiens 
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<400> 639 

ttttnnctnt taatcaatcc tttgttgact ccttggctac ttgttctttt tgcaggatcc 60 

catcgatncn aattcggcac gangtgatgn cagattgnna ntncactaaa ctgggcannn 120 

catcaggatc acctgtgggc cttcannaat cananatnca cccccaggcc atgccctnga 180 

cccagtgcac caggacaaga aatccacccc aggcctctcc cnagacccac tgnaccagna 240 

caagaaatcc acccccangc cangccccnt acncactgcc ctangatntn nnggtgtnaa 300 

ccnggtggtg ctttgtaaag acgtgcangt ggtaacccca cgccgncncn ctcnnnacnt 360 

tggacacatg atcatccacg tgtctgtgat ttgnttcctc ggnttnnttt gtgaatngaa 420 

aataantgtn ncgtttgact agggtttaag agcagcaggc agnccctcag ctcagcaagc 480 

ngccctctca gctcagcang cagcccaagt ctcctgtang acttctatgg accatnctgg 540 

cgggaatgaa gaaactggtc aagctggatt cgggactgaa agtgtaccnt ggtgacaccg 600 

tatgactnan ctgactnana aagatcactn atctttccac acttgnnggg naggagccnn 660 

tannangttc aatatgcnnt ggtngantcc catngctaca atttcatgga cacantttga 720 

ttacttnnga taannnaggc ccttggaggc cccttntccc cttttaacng gaat 774 

<210> 640 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 640 

ctnnncctcc ttgatccntt cctnctttga anncatnngc tacttgttct ttttgcagga 60 

tcccatcgat tcgaattcgg cacgaggctg acctacatca gaagctgctg gatgcagnaa 120 

agtgaaaaca gaccaaaaca acacngggcg aatcttnaca ccattntggg tgccnnatnt 180 

nnccrmngat atttgcttgc tnagctctac tcctccaaga nannangnnt caaacnctnc 240 

agcangntag agcanntnaa gaccgcntnt nctnacctnc tnaagannct ctgngaggan 3 00 

cgcaatcctt tngtggaana tagaatcaac agaccacact genet ctgga ccatgngctc 360 

tcaaangngc tagaaggtgc tgaccttttn agactcttgc agaagaggcg angtggtgng 420 

anaccctnna ggaanacttt cccgaactag accncnnctt nengaaenng ntcaactgtt 480 

ggggnngaaa ncntgtgann tgtngncctt cngagagacg gcatattcta tgatggcnga 540 

ettnatnett ctgcggaacc anactngacn tactgaaaga aanctganac caagegtett 600 

ccttaaggac ccttatatcc agacnatcct ttggataata ccnctnggcc aaaacctnnt 660 

aactntgeat acaatcngga tggcaacatt tgaactggng gecttnanna ccnttaccgg 720 

ettttencat tatgnaagag ntn 743 

<210> 641 
<211> 740 
<212> DNA 
<213> Homo sapiens 

<400> 641 

ctttcctttg antcttcttc tannaaacgt tngaacgaan tcngcacgag accactaaca 60 

gcatctactt gactactgat actttgatca tggagtttgg gcatgccact tgatagaaat 120 

ttgaagagca attatatttt tcaaaaagag ttttgaataa tgttaagata gattgeaaca 180 

tgactatcaa ttcttccctt cccatcaaag gagagagtcc gtttatccag cctttgaatc 240 

ttgattattc aagtgacttg cttcacccaa tgtaacatta ataagcacaa tacaagcaga 300 

ggcttgccaa gaacttggtt tgtttctaat gcttagaaga agaatggtgt atgecatatt 360 

tetgeattta gaactcacgt ggagacatgt gtggcccaat tgctcctctt tcatctcagg 420 

caataaccag acaegggact gaggccatcc atgaccagcc agccctagtc aacacacaac 480 

acacaagctg atcacagatg catgagtaag cctaactgag accagccaag accagcctag 540 

aatagaactg ctcagcagca ataaaaacta aataaattgt taccttaagc tacttttaga 600 
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gctatttgga agtgtatttt tgtgcagcta acatttacta tcagataaaa tggtgattgn 660 

ttatctctgn tttaatgatg ntttaaggaa atggttctat taaaaggaaa tatctggggc 720 

tttgtcaccg ttaaaaaaat 740 

<210> 642 
<211> 737 
<212> DNA 
<213> Homo sapiens 

<400> 642 

tancctttga nnctttctcn ncntgncntn rmngnaacga cctcggcacg aggacacccc 60 

agatgcagcc accaccagca gaagcgatca nctgacccca caaggttttc gtggctgtgg 120 

ccgtgggctc aggtggcagc tatggagccg aggatgaggt ggaggaggag agtgacaagg 180 

ccgcgctcct gcaggagcag cagcagcagc agcagccggg attctggacc ttcagctact 240 

atcagagctt ctttgacgtg gacacctcac aggtcctgga ccggatcaaa ggctcactgc 3 00 

tgccccggcc tggccacaac tttgtgcggc accatctgcg gaatcggccg gatctgtatg 360 

gccccttctg gatctgtgcc acgttggcct ttgtcctggc cgtcactggc aacctgacgc 420 

tggtgctggc ccagaggagg gacccctcca tccactacag cccccagttc cacaaggtga 480 

ccgtggcagg catcagcatc tactgctatg cgtggctggt gcccctggcc ctgtggggct 540 

tctgcggtgg cgcaagggtg ttcaggagcg catggggccc tacaccttcc tggagactgt 600 

gtgcatctac ngntacttcc tctttgcttc atccccatgg tggtcctgtg gctcatccct 660 

gtgccttggc ttgaatggct ttttggggcc tggncctggg ctgttaaacc gccgggctgg 720 

natttaacct ntnggcn 737 

<210> 643 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 643 

cttttaaccn tttganccnt ccctcnaaac cttngatncg anttcggcac gaggaaggca 60 

gaagtgtaaa tgaacataca ntttaaggag aaagcctgct gtgtttnnct tgttcagcag 120 

ggtattatga attagcacaa gtattgcttg ctatgcatgc taatgttgaa gatcgaggga 180 

ataaaggaga cataactccc ctgatggcag cttccagtgg aggttactta gatattgtga 240 

aattattact tcttcatgat gctgatgtca actcccagtc tgcaacagga aacactgcgc 300 

taacttatgc atgtgctgga ggatttgtat gacattgtta aagtgctcct taatgaaggt 360 

gcaaatatag aagatcataa tgaaaatgga catactccct taatggaagc agccagngca 420 

ggtcatgtgg aagttgcaag agttctttta gatcatggng caagcatcan cactcattct 480 

aatgaattca aagaaangtg ctctaacact ngcttgctac aaangccatt tggatatggg 540 

gcgctttcta cntgaagctg gtgcagatca agagcncaaa acagatgana tgcacactgc 600 

cttaatggan gcctgcatgg atnggacatg tanaggtggc acgtttgctt tttggatant 660 

nggtgctcan gtgaacatgc ctgcataatc atnttgaatc tccattgacg ctagctgcct 720 

gtgganggac atgttgaaat tgcngcct 748 

<210> 644 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 644 

tcnnncnctt ttcgatcttt tgagncttgc ctttgaaccc cttggntacg anttcggcac 60 
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gagggaacca 
cttattgggn 
tnngagcnan 
acatgctnta 
cngtacnctn 
ncactgannt 
ncttggatat 
tancanctat 
attannncgt 
caatgnttgt 
ctggnttgac 
gccatgaata 



tgananccna 
cgngntncgt 
ttntgcnntt 
ctantcantg 
tggccgnatg 
nactatccan 
tatagnaagg 
accnactgtn 
cnaattgnga 
gncttanaan 
ctaagnggng 
ggncttttgc 



gagctagaat tgctattgga tnncgtctat tctctntttg 
ggttnctggc ctcangggtn nncccgaang anggggtatc 
tacnggctag cttgntgggg gcttaanntg ccactnttan 
agannntncn ntcgaccatn tannacnatn ctgtgnnntc 
gagctattag cttcaanatg nntcgnantg ttacatgcan 
natntaagtn ctcttngctt actgtgaaca nnngctactn 
ntcnttgata cncgatnatc ntncntgtca gatcnataaa 
naaatnccat ctggnggnct tncnatccan acataattgc 
tanagtnttg aaagantctn ggtttagacn ttggatgttg 
ttatgtgctg gctactgant aanctggggg catgacntta 
aantcnatgg tccgattgct ggnccctanc cttaagnttt 
cctaaaataa naccccttt 



300 
360 
420 
480 
540 
600 
660 
720 
759 



240 



120 
180 



<210> 645 
<211> 766 
<212> DNA 
<213> Homo sapiens 

<400> 645 

tnnnnnnntt tcaatntttn ancgtccctt aggatccntc gattcgatcc agatgggata 60 

cctctaaaca cgaaaagaaa gaagattcca ttantgaatt tttaagtttg gtttnatcaa 12 0 

aagccgagcc acctangcaa cagtccaccc ccttagtaaa caaagaggaa nagcatgcac 180 

cagaatcatc cgcaaatnag acagtcaaca aagatgtgga cgcacaggct gaangagaag 240 

gganccgcca tccatggact tattcatggc catctttgcc agttcctcat atgaaaagtc 300 

ctnatcctgc gangatganc acggtgacag tnaanatgat caggcacgct ctggngagga 360 

caacttccaa agctggnaag acactgactt ggnggaaaca tcatctgtgg ctcacgctnt 420 

tgtgccagng ccctaggagc cgtcaccttc cttcccgata caaangatgc agatagatna 4 80 

naganaagag ntcggccngn ngctgcctcc cgtcttatgt nccaatgctc gtcagacact 540 

tgaagttnct canaaagaga aacattccaa gaacaaagac nagcacaang gcaatanaga 6 00 

acacaggccn gaaagaattg anangaaatt ggaaacactn gaagcacnaa acacctaang 660 

naatccaaaa naattggcaa accaggggaa aagtaggtnc ctncgngaag tttcgacagc 720 

cngcggacaa gccanaattg acnatgaaac cgcatacgtg tcttnc 766 

<210> 646 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 646 

ttnnnnnntt tttatcctnt natncttnct ctttggatcc atcgattcgc tccaaggaaa 60 

atccacctcg cagcttgtaa atctacagcc tgattacatc aaccccagag ccgtgcagct 120 

gggctccctt ctcgtccgcg gcctcaccac tctggtttta gtcaacagcg catgtggctt 18 0 

cccctggaag acgagtgatt tcatgccctg gaatgtattt gacgggaagc tttttcatca 240 

gaagtacttg caatctgaaa agggttatgc tgtggaggtt cttttagaac aaaatagatc 300 

tcggctcacc aaattccaca acctgaaggc agtcgtctgc aaggcctgca tgaaggagaa 360 

cagacgcatc actggccgag cccactgggg ctcacaccac gcagggaggt ggggaagaca 42 0 

gggctccagc taccacagga cgggctctgg gtatagccgt tccagtcagg gacagccgtg 480 

gagagaccag ggaccaggaa gcagacagta tgagcatgac cagtggagaa ggtactagtc 540 

aaccttcaga aagagtatgg agagaaaaag aggcacacct ggacgcagag ccctgccagc 600 

gccctctctg ctgttgcagc tgcaaggaga ccatgcctgt gggagccagg cctcgcttgc 660 

atgaanaagg aacgatgcct ttttcaatgg tgtcttcctt ccattgtgca naanaacctt 720 
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ttggtggctt ctcttccgac ttgtgcctga tt 752 

<210> 647 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 647 

ttaatccttt caattcgttc ntctttggat ccatcgattc gaattcggca cgagccctcc 60 

ccggcttccc ccggagtggg tcaccacact gttttttatc atcatgggaa tcatttcatt 120 

gactgtcaca tgtggtttgc tggtggcttc ccactggcga agagaagcta caaaatatgc 180 

tcgatggata gcattcactg gaaccactat gagaagatta taggaaaaac accaagacta 240 

gaggactctg ggttcctttt atgcaaagtc aactcttctg ggtcacagtt acccagcaac 300 

aaaaataaag agaggaccag gacgatgcca gcaccccgtt tatcctgagt gaactctccg 360 

gaggcctctt caagcttgtg ggttctctgc tgtcttgaag ccatccatcc atttgatagg 420 

ttttgcaaag acttggtcct gccaagatgg ttttaatcat ttctgctaaa aggaatggac 480 

tcgaggattt gatctcattt tagatgcagt tgtcctcact tggccatttt acagcacttt 54 0 

agtaaatatg gccagtgtat ttggtcacta ttaaatcaat ccccattcat tatctgtcan 600 

ggcaactcag tgaactaaat actatgttct gacctctggc actctttctc atgttggtta 660 

aatatttaat attgnctaag gcaattcaag tatttttctt aaataaaaaa tatgaaaact 720 

caaaaaaaaa aaaaaaaaan ana 74 3 

<210> 648 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 648 

ttttaatccc tttcatttcn ttccttngta ggatcccatc gattcgtttt tttttttttt 60 

ggtgattgga ttaacaattt tattctgnnt ccactacaaa ngggctggtg ttttgttcca 120 

aatgtttagc tgggagggct gtagggaccc ctgttacccc cattaaacac agtaaagcat 180 

ggatccagtc agccccctgc tggcaggtgt gggcctggca actacacaga tccaacccca 240 

ccctcctggg tgcggccaga ggccaaggca gtcgcccgag ctcctgaatc ccaagaatgg 300 

ttctggcaag tactgctgtt tgtttgtagg ggcaaagagt taaaataaaa cgaggttctg 360 

ccatggctaa gccttgtgga aaccagaccc caaagcccct gccatgccan gggtctcaac 420 

nccagacgct tgttatggag gcaccancng gtantggccc ctgtaagcan ggccagagtc 480 

gggacaaaga gcaagantga aacanccaag agacanagga ccatgctgga ccattgggca 540 

cncangaacc tgcctgggaa aaaccggggg gcaangctgg catgggaatg aacacctgct 600 

tgntgacacc tatntgagct tcanttncct taacttgaaa aattgaacan gcccggtncg 660 

gtggctcata ccctgtaatc ccancacttt tgggangctt tangccgntt ggatcattga 720 

ngttaggaag attaaagaac cancctgggc cnacattgg 759 

<210> 649 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 649 

tnancctttg aatccttgaa ngnngatccc tcgattcgcc ggaacctcat ccagtgccac 60 

ccatcttgac accttctccc tcttcagctt ttccaacagt cactactgtg tggcaggaca 120 

atgatagata ccatccaaag ccagtgttgc atatggtttc atcagaacaa cattcagcag 180 
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acctcaacag aaactatagt aaatcaacag aacttccagg gaaaaatgaa tcaacaattg 24 0 

aacagataga taaaaaattg gaacgaaatt taagttttga gattaagaag gtccctctcc 300 

aagagggacc aaaaagtttt gatgggaaca cacttttgaa taggggacat gcaattaaaa 360 

ttaaatctgc ttcaccttgt atagctgata aaatctctaa gccacaggaa ttaagttcag 420 

atctaaatgt cggtgatact tcccagaatt cttgtgtgga ctgcagtgta acacaatcaa 480 

acaaagtttc agttactcca ccagaagaat cccagaattc agacacacct tcaaggccag 540 

accgcttgcc tcttgatgag aaaggacatg taacgtggca tttcatggac ctgaaaatcc 600 

atacccatac ctgatttatc tgaangcaat tcctcagatt tcaactatca aaactaggga 660 

aaactgngag tttaacacca agtnctacaa cacaaggttg gaaacacctg aacttggngg 720 

atcatgatac cacttnacca ctccnt 746 



<210> 650 

<211> 789 

<212> DNA 

<213> Homo sapiens 



<400> 650 

tgaccctttt gaaantcctt gcatntttca naccntttgg tacnnncant ttnngntgga 60 

tccctcgttc gctgnacaaa agatgttttt caattaaaag acttggagaa nnttgctccc 120 

aaagagaaan gcattactgn tgtgtcagtn aaaggaancc ttcaaagctt tattngatga 180 

tgggtttggt tggactgtga gaggatcgga acttctaatt attattgggc ttttccaagt 240 

naagctcttc atgcaaggga aacataagtt ggaggttctg gaatctcagt tgtctgaagg 3 00 

gaagtcaaaa gcatgcaagc ctacagaaaa gcattgagaa agctaaaatt ggcccgatgt 360 

gaaacggaag agcgaaccag gctagcaaaa gagctttctt cacttcgaga ccaaagggaa 420 

cagctaaagg cagaaagtag anaaatacaa agactgtgat ccgcaagttg tggaagaaat 4 80 

ccccaagcaa attaagtagc caaaagaagc tgctaacagg atggactgat taccatattc 540 

gcaataaaat cttgggccaa aagaaaattt gggttttgaa agaaaataaa aattgatngg 600 

aactttttgg aattccagaa gactttgact acatagactt aaaatattcc atggttggtg 660 

aaaggatgta ccaagctttg tgaaatattg taaattttta aacctattat ctactaaagt 720 

ngtactggaa ttgtccnttt gcctgttnac ttgnggtnta ntcatttnta tttaatgntn 780 

aaattaang 789 

<210> 651 
<211> 757 
<212> DNA 
<213> Homo sapiens 



<400> 651 

tnnnnnctaa ncctttgaaa tcgtccntgc atgatccctc gattcgaatt cggcacgagc 60 

agatatttac tgaaggaatc taggttgttt tttcagtgga caatgggaat aanncatttc 120 

taaagcaccg actggagagg aaggcaacag agacaaggag agaagccgag agacatgtct 180 

gcgtgctgcc acgcatctga gcgattgctc tgtgaagagt tgtacactga acattttcag 240 

gggaggctgt ttacccaggc aatgtcctca aacaagcctg tgccggggtg tcctggaatc 300 

tgtgccagga ctgtgttttt agcccttcac ctctcagctt tagcaggaca tgaaccagtt 360 

ataacaagat ggccctgcag ctggttacag gaatgtgaca tggcaggatc tatggaacca 420 

aatggaaggt tttnaggtga tgtaggtctt tcacagttag ctttggggaa tacagaatac 480 

tcaaataaag tgctttgtta ttatttcaga gggaatggcg attgaaatgt tacaacagag 540 

atttcttggt ggtagctatt tgggtaaang tatatggata tttntctgta catgtgaaat 600 

tatntaaaat aaaagttata taaattacat tgacaaaaaa aanangtana aaaaaaactc 660 

gaacctttaa aaactatngt ggagtccgta ttacgttaga tccagacctt gataaganac 720 

cattgatgaa ttttggacaa accccactng aatgcnn 757 
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<210> 652 

<211> 759 

<212> DNA 

<213> Homo sapiens 



<400> 652 

tcnnnccttt aatgctttga actcgttgca ctgcangatc catcgattcg aattcggcac 60 

gaggctgncc aggcagttnn atggcctnct ggttgtgtgc cttcacaccc gcctacagcc 120 

ccacctcacc atcaagcgct gagccaatgc ggntgtggct ggccctgagt tcctgagtca 180 

gctccttgcc agggccagag ctggtnacag cggggcanca nggtgggtag cctctaccag 240 

ncagggcagt ccctgagggg ccagcanggg ggctgactgc ctagtggctn aacctactga 300 

acccacccac tcccagcgat gctacccaga accccaacgg cntgaatcct gcacantgcc 360 

gggcantgcc agactcnaaa gggctcgctg tggggacagc cccgtcatgg ccacanactc 420 

tgtcctcacc tttgattgtc aggatgacag nccccaccac catgatgagc gtctgcaggg 48 0 

cgtccgtgta gattacagca gccaggcccc ctgccaatgg aagcaaggtt gctggaaggg 540 

gccctggtcc agggaggaag gacaccggga ggaacttctg ggcttctgct ggggccactt 600 

cctgggctgn tnctcnggnc tgtatgggga agtggccttn tgaccccttt acacgttccc 660 

tgggtggacc ttccctgntt gcangcaccc ataccttgcg atggtgtnca nggctnttga 720 

tgcccnaaac tttaggattg ttggatangt nnaatctnc 759 



<210> 653 
<211> 820 
<212> DNA 
<213> Homo sapiens 



<400> 653 

tgcaatcccn cngnnaatcg ctttgaaanc nccntcnctg tatgatccca tcgattcgca 60 

acagtccagg ctctgcagac agcatcccac ctgtcccagn tngctgacct gaggagcatc 120 

gtggnggaga ttgaggacct tgtngctcgc ctggatgaac tcgngggcnt gtatctccag 180 

ncanaanaan gacngcatac aacagaccat tangangntg tcatctacan tntnanngat 240 

catntgngna cngacccatc cattaatgag gatcanggcn tccanctgat gaacgctgat 300 

cttctgcaan aagaacgttc tagntctanc nnanngccnt cancctncgn ctcttgagct 360 

cagtgngtca ngctcntaan atcttnncac ntgccaanct gtgnggnctg ccttnagnct 420 

tccggatagg cactntnatn ngacntgccc tatanttgcc ngcngnnant naaccaantg 480 

naccatngtc actctgttga catcanggcn atntgnntaa actaatnnct tngcngcact 540 

ctagtnngcg ttgncactgc ccncgtnnnc tancntacca nttcncattn cccntttaat 600 

gggnaaagan atnatcccta cnatcatatt ncccntnnaa tggattcgag ncgnaantct 660 

tnnntantna tctnaancct aaatgntcac atnnaaactt tanangncat cnnnnatgna 720 

accnancnat ggctaaangg cctcattaan gccngntttt tcaaacttga aaantgcatn 780 

ccnccattga naaagganta cacgggcccc cntgngnggg 820 



<210> 654 

<211> 768 

<212> DNA 

<213> Homo sapiens 



<400> 654 

tttnnccccn ttttgtncct nttgattcnc ttgctacntn ttcaaatcng tnggatccca 60 

tcgattcgcc acatttaagt gagatatggg aaggaggagc agattgtttt tgaagggagg 120 

aagagcagtt acttagggtc aaattaagtt gtaaaatccc ccccgggatt ttgtatgtaa 180 

gtcaaagtga attgtatttg gaagaagaac tggggagccc acctctggta ttttttttat 240 



196 



WO 99/33982 



PCTAJS98/27610 



gtccctcata tggacaaata aacctctggt attaaatgaa ttttcttttg ggggattcta 3 00 

tatattcggg atttcaacca ccaacctatc tggtttttcc cgctgaaatg ttgggtgatg 360 

gaatcaggag agcagatttg gagactcttt atattttata attgagagag acaaagagaa 420 

aaccgtttga tttgaaaaag ttttctaggt tccctcaggt agatggaaat tttcatcaaa 480 

aacagtttat tcaaggtaca tagcctacta gtttcccatt tgagagtacc gcagaatgat 54 0 

acgacgtgta ctgcttctct acgcagaatg aagtataaaa ttagcaccna atagtacttt 600 

aatttgcagg tgctaaactt tttacatgct tnatctcatt taattcttag aagaaactaa 660 

ttttaccaag taaantgtct ggaccaacca tntgcaggtc caaaannctg gaaaaaccgt 720 

naggtttgga ctcctacata gcctnttttn taagtnncnt nntaaatn 768 

<210> 655 
<211> 752 
<212> DNA 

<213> Homo sapiens 

<400> 655 
tntncctntt gatccttgca 
gggtaaacct atttatataa 
agatattata tattaaatgg 
ttttatgttg ccttattttt 
gaacagatgt aatattgctg 
gccaagatac aaatttaaat 
cttgtttata tgtttgaaga 
accaaatatc aaaaccacct 
ttgnttatga gacacantct 
ggcacgatca cggctcactg 
gccttccaag tagcanggac 
ttttattgga cagattgggg 



<210> 656 
<211> 754 
<212> DNA 

<213> Homo sapiens 
<400> 656 

ttttcctttt natcttgctc nanaanccnt ggatccctcg attcgcagag gctggttcag 60 

aaaaggagga agaggcccgg ctggcagccc tggaagagca gangatggag gggaagaagc 120 

ccagggtgat ggcaggcacc ttgaagctgg aggataagca gcggctggcc cangaggagg 180 

agagtgaggc caagcgcctg gccattatga tgatgaagaa gcgggagaag tacctgtacc 240 

agaagatcat gtttggcaan aggcgaaaaa tccgagaggc caacaagctg gcngagaagc 300 

ggaaagccca cgatgaggcg gtgaggtctg agaagaaggc caagaaggca aggccggagt 360 

gagtgcctgc ggcccctcac agggctgang ccagccccta tcagctggat gtggcagagg 420 

catgccanag gacctaagtg tgatggacca gantcacttc tnctcctcct ttctncacca 480 

gccctgaccc ctcatgctct ctggctgggc cantgggcaa ccctcgcttc cttggatgga 54 0 

ctgcctgctg gtgcctggtc agagaanagc ctnttttccc agnctgattc tntgctccca 600 

ggaaccaatt gaccatnaag gtgcaaangc cnanccaatc cccttacnta ctggccccca 660 

ttnattcctg gctttttcan aagcccccnt gccaaacann ttgggacccc ctgattnntt 720 

aagggtgcct tttnatnggg gttaaaggtt aant 754 

<210> 657 



ctannaaatc 


cgtggatccc 


tcgattcgaa 


ttcggcacga 


60 


tagaaggatg 


attataaaca 


tttaataaat 


tatatcaaat 


120 


gcagataata 


gaaatctgtc 


caagcaaaac 


tctggataat 


180 


tgttttctgt 


gaactccaag 


aaaaatgaga 


taccagtttg 


240 


atttaacagt 


ttagggatac 


tccccaagtt 


caataatttt 


300 


ggaacctttt 


atgaagcttc 


atagtgtgtg 


aagaacttac 


360 


catacatatt 


tcacatttca 


gaagagtcta 


tacatagctc 


420 


tgttagaaaa 


cattaaggtc 


tgtcttattt 


atttgttcat 


480 


cactctgtaa 


tctcactctg 


ttgtagaggt 


tgagtgcagt 


540 


caacctncat 


ctccctgact 


caaggaatcc 


ttccacctca 


600 


caccaggtgc 


accccactat 


gcccagctta 


attttttgna 


660 


ttttgcccat 


gttattcagg 


ctggatcctt 


nnggcctcaa 


720 


atctggcctg 


cc 






752 
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<211> 734 
<212> DNA 
<213> Homo sapiens 

<400> 657 

tntgttccnc natgaacgnt ngaancnnna tnccnttgga tcccatcgat tcgctgcggc 60 

cgcaggagct gtggcggttt tcctaatcct gcgnttatgg gtagtgcttc nttccatgga 120 

cgttacgccc cgggagtctc tcagtatctt ggtagtggct gggtccggtg ggcataccac 180 

tgagatcctg aggctgcttg ggagcttgtc caatgcctac tcacctagac attatgtcat 240 

tgctgacact gatgaaatga ntgccantna aatnaantcn tnngaactan ancgagctga 300 

ttganaccct agtaacatgt ataccaaata ctacattcac cgaattccaa gaagccggga 360 

ggttcagcag tcctggncct ncaccgnttt caccaccttg cactccatgt ggctctcctt 420 

tnccctaatt cacagggnga agccngattt ggtgatgngt tacngaccac gaacatgtgt 480 

tcctatctgn gtatctgncc ttatccantg ggatactagg aataaagaaa gtgatcattg 540 

ntactttcaa agcatctgcc gggttgaaac gatntncatg tcccnaaaga tttgttgatn 600 

tgcagctnct cantgctann gtcggttttg aanaaagttt nccaaatnnn tgtaccttgg 66 0 

gccaatttnt ngacaantng aactgacttg tnagaatctt gcagntaacn gtcttgtntc 720 

ntccaattng ggng 734 

<210> 658 
<211> 783 
<212> DNA 
<213> Homo sapiens 

<400> 658 

ttctcctgaa acgcttngca cttccctcnc tgcaggatcc catcgattcg aattcggcac 60 

gagacactgt cccactccat cacccaggct ggagtccagt ggtgtgatca tagctcgctg 120 

catcctccag ttcctgggtt caagccatcc ctcctgcctc agcctcccca gtagctggaa 180 

ctacaggtgt gtgccatcac acctggcttt acatttttct gtggggtctt actatgttgc 24 0 

ccaggccggt ctcaaactcc tgagctcaag tgatcctctg nctcagcctc cagagtatct 300 

gggattacat atgtcggcta ccgtgtctgg ccgttcacat ctttggccac tattngcttg 360 

tgaaaaggta tnatgaggtg gtacttatca tngttactgt gtctcatgtt nngtatattt 420 

ttgcttcatc aactaagatg cactgtaaca tctgtgaaat ctggatatat tatcaaangg 480 

tttatcatag ttttgttaac aatacactgt cgttttactn ggtgcctaan ataatggtat 540 

agttgngagg tgatcttaga tttgatgaag cacagtatgc aangtaggcc taatggnggg 600 

aaagaatggg naattttcan angcnnggaa gtatttgntn ttttgtaaat ggacttgaaa 660 

agcttgttct gnnggattgg acccaacccc tttccctttn aaaccccgaa ttctnatnga 720 

ctnttccaac ttngaaaact ttgctcnaac ttaaatacct ttnaaaaatt aaccntgacc 780 

ccg 783 

<210> 659 
<211> 741 
<212> DNA 
<213> Homo sapiens 

<400> 659 

tcttcctttg tatacctgct nttgctcttt ntgcaggatc cctcgattcg ctttgagcta 60 

ggataaaaat tgggtaaagg acatttgctt acctgcaaat gaatcactgt ggaaatgtga 12 0 

tcttcccata tcatcaagaa acttgttttc tggatgaata ctgggagaat aaaatgagaa 180 

ctctggagtg agctaaattg atcccaatta agtttttctg cttagcagac agaaggtata 240 

attttttgac accctttccc acctggtgcc tatgctaggc ttgtcctgag aacatccctc 300 
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agtaacttga tattcacatg acctacagga tgtcccatct gcagggctga gtcagttggg 360 

gaacaccaga ggctacacag tagctcttcc tgctactcgg ttaatgagct tggcaggttc 420 

tttgtctcac tgaattctta tcatggaaac agcagcagca gccgctagga aatcttcaag 480 

tgtagtgtct gtgctaaccc agtggtaaat cccttagatc ccctgctggt ctctggcagt 540 

ctccttgatt ttgggtacca tgtatatttt ccgctttgac tttaacgctt tctaggatag 600 

ggtaagcacc cttaattcan gcactgtcca ttagcttcct ttgcaaaagc tacttatggn 660 

cggtcacaat ncaacactna nacagagcca aggcaatatc ctcttgccca tggctatgat 720 

gtcagacagt ggatggctcn t 741 



<210> 660 

<211> 734 

<212> DNA 

<213> Homo sapiens 



<400> 660 

tctgnnctnt gtntccttgc tcgtgttctt ttgcaggatc cctcgattcg aattcggcac 60 

gaggactgga gaagtcagaa gtagaaaagc agattgctag gagagacagg atgacagatt 120 

ttggtcagaa aatgggatat tggagtttaa agtatcaaat acagaatagt tccagatgtt 180 

cagagatcca gcatgggatt aggtactgaa atggattaga actaaaagtc actagaattt 240 

agaaattgag aaccatgaga gtggatgcaa tgacttgttg cttgattgaa aaataaatta 300 

ataataataa aggaccatga gactagcctg ttataggggt tatctccatg aacattgaat 360 

tttcccagga tcatagcagg aattgggtag agaaaaagat tatgagaagg tgccagagtc 420 

ttcagtgaat gtcaggaaat taccaggaag tcagcatatg acagagaaaa ggacagtatg 480 

ttatctgcat caaaggaaaa tgtgcttttg ttgaaaagta cagaaaaagc caatactaca 540 

atactgtgct aagcccctac ctgtactcct ctcccacagc tgcattccag ccctgtggta 600 

taaaaggtgt gagaatgagc ttttccacca gaatcagcag gtttagttaa agcatgagca 660 

gaacaagcat nctatgaaga gactgaggat gtaggtgagt ggtctaaatc tcatnnaagg 720 

acattgcagt ngat 734 



<210> 661 
<211> 762 
<212> DNA 
<213> Homo sapiens 



<400> 661 

ttnnnnnnct ccnaatcctc cngatnanat cnctttgnan ctncctgcag gatcccatcg 60 

attcgaattc ggcacgaggt ccatacatgg agctccctgg agcccgtgtg ntntcgtgtg 120 

actgaacgtt ttgtgatgaa aggaggagag gctgtctgcc tttatgagga gccagtgtct 180 

gaattgctga ggagatgtgg gaattgcaca cgggaaagct gtgtggtttc cttttacctt 24 0 

tcagctgacc atgaactcct gagcccgacc aactaccact tcctgtcctc accgaaggan 300 

gccntngggc tctgcaaggc gcanatcact gccatcatct ntcagcaagg ngacntatat 360 

gtnnntgacc tgnagacctc agctgacnct nccttngtan ggttngatnt nggaagcatc 420 

ccaaggngat ttagngacnn tggantcctn atnactgata anacncnaac tatantnttt 480 

tacccttggn agcccaccag caagaatgag ttggagcaat cttttcatgt gacctnctta 540 

acanatatac tctgaatgaa tctacgttgt atttatcagg nggacaatgg gaataaagcn 600 

tttntaaagc accnantgga catgaaagca acagacacna ggagnnaagc cttgagacat 660 

gtctgnnntc tgaccgcatn ttgatccant gntctgtgan ganttnttca ctgaacattt 72 0 

tcaagaggag ggtgnatacc cctggcaatn gcccnaanaa ag 762 

<210> 662 
<211> 745 
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<212> DNA 

<213> Homo sapiens 

<400> 662 

nanatccnnc nantncttnt tgttcntgtc cgnangatcc catcgattcg aattcggcac 60 

gaggtttcat ttaagaagaa tganctagat anatgtgctc ttctggttac cccaccctga 120 

cagagtgcat ttttacacgg ctagcagggg ttgagactgc agcctggcct gccagccatt 180 

ggaggtgttt aaggaagggc agataatgtg actctttgcg gggtgccatc tgcttaccca 24 0 

ttagcgagca nagggggttt ctgcgggtga cccccagcat atttctaggt tacttatggg 3 00 

cagatttgta agtgacaaaa ctccagctga tgctgggaat ggggagaggg cccttgaggg 360 

actttgtggt tttgtgcttc tggtttcctg gccaacccca gggtcacttg tctggaggcc 420 

cagctgggca ctaatgtctg ccaccgacta tgttaaagtg tataaatgat tcctctattt 48 0 

gggagagatc ttccaatcca gaggagcccn tcttggactg cctgggttaa atctgcatan 54 0 

cagangtggt tgatgaagtt catctgaaga aattcagccc cacctnccca ccctgccntt 600 

cctgctccct tttgatagtg gcttctgggt actcgggcnn gtncttggga caccancctt 660 

ntctgggggt ctnaagccat cccgttgggg ctgtcggcca agcctaagtt aatcgtgtgc 720 

ctntattggg aggatngctn ntcct 745 

<210> 663 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 663 

taatcctntt gataanaatc cttgtncttg ctnntgancc ntcgattcga attcggcacg 60 

agggcaagtt tccaaagatc agtgtggagt gctacagaaa taattatagg agaggaaatc 12 0 

ataatcacag aaggtataat gcttgtttga ggctccggaa taagaactaa aaaaaaacaa 180 

aaaacactgg tttcatgctt acggggtaca cactttggtg catcccgtga acacaaattt 240 

taataccaaa caatccttga tgcttcacct ggggctgcca agcagtttgt aaaacagagg 300 

aaaacattta gtgcagtctg tattatcctt ttccaacttt tctgtttgtg caagtttttg 360 

aagattcatt ggccaaacaa tgaacaacaa aggttttctg agagaagaca aggtggactt 420 

ttcattttgt tagtaaatac cagtggcact gttgaacgaa acaaatactt ttatctcagt 480 

ctttcaaatc agtattaatg tctgtgtttc cttccactga cagctcttct tctagtttca 54 0 

ctgaaaaaag ggtgttagta tttttatctt ggacactctc ttccaaatcc ttcagcagct 600 

cctcttcttt atattctgcc acatcgacct ctaaaccgga attgtccttc agtttgccgt 660 

ggtgcttgag atantacccg ctggttctga aagaacttga tgatggtgta ctttgggaag 720 

gtcnaactgg gcanacagag tctggatt 74 8 

<210> 664 
<211> 785 
<212> DNA 
<213> Homo sapiens 

<400> 664 

gtnnnnccnc nnaccctnnt gaatntaatc cttgttcttg 
ggtcaagctg gccctggatg tggagatcgc cacctnccgc 
gtgcaggctg aatggcgaag gcatatggac aagtcaacat 
ncttcagtgg ctatggcgnt gccagcgntg taggcagcng 
gcagntactc ctatggcant ngncttgncn ttggatgcng 
nagccactgg gggtggcctn agctctgtng gaggcggcag 
ccacctcctt ctccagcatg aagagctaca ngcactgaan 

200 



ctgcatgatc 


ccatcgattc 


60 


aagctgctgt 


agggcnagga 


120 


cnntgnagng 


cagtccaccg 


180 


cttaggcctg 


ggngnnggaa 


240 


cnntagtncc 


agcagcggna 


300 


ttccaccatc 


aagtacacca 


360 


tgctgccgcc 


agctctnagt 


420 
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cccacagctt tcaggcccct ctctggcagc atagccctct cctnangttg cttgtcctnc 
cctgncctcc antctcccct gccctaccgn gnagagctgg gatgccctca ctttntnctc 
atnaatacct gtttcactga actcctgttg cttaccatca tgtcncagtt atcagcactn 
aaancatgct aatgnccttt tataagnccc ngtatttatt acaagnatct tgaantctgc 
cattaaattc ttgaggaang aaaatgacct attatccccc ataaagaacc tgaaacttca 
agnctaangt cccagcntnc aacanggaag gagntccntt tttttnattn gctaaaccan 
tcctc 



480 
540 
600 
660 
720 
780 
785 



<210> 665 
<211> 763 
<212> DNA 
<213> Homo sapiens 

<400> 665 

ggnngntgnn nntnntaatt nctnttnaat nncantcctt ggntctngnt ntagganccc 60 

atcgattcgc tgaaccctaa aggaaagcca gcaaaccagc tgcttgctct caggactttt 120 

tgcaattgtt ttgttggcca ggcaggacaa aaactcatga tgtcccagag ggaatcactg 180 

atgtcccatg caatagaact gaaatcaggg agcaataaga acattcacat tgctctggct 24 0 

acattggccc tgaactattc tgtttgtttt cataaagacc ataacattga agggaaagcc 300 

caatgtttgt cactaattag cacaatcttg gaagtagtac aagacctaga agccactttt 360 

agacttcttg tggctcttgg aacacttatc agtgatgatt caaatgctgt acaattagcc 420 

aagtctttan gtgttgattc tcaaataaaa aagtattcct cagtatcaga accagctaaa 480 

gtaagtgaat gctgtagatt tatcctaaat ttgctgtagc agtggggaag agggacggat 54 0 

ntttttaatt gattagtgtt tttttcctca catttgacat gactgataac agataattaa 600 

aaaaagagaa tacngtggat taaagtaaaa attttacatc ttgtaaagtg gtggggaggg 660 

gaaacagaaa taaaattttt gcactgctna aannnaaann actttccagc naanctaaaa 720 

aactnnancc tttaaactat antgagttcg nanaccnggn ccn 763 

<210> 666 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 666 

nnttnnatan nngctcttgt tctttttgca ggatccctcg attcgtctag acctctgaca 60 

tcatggtgtt ttcttaatgc ctcacattgc tggcacgggg atgtgccctg cctgccagca 120 

cctaggactt cgagttgggt tgcagcttat gacatgcatg ataggttttg gaaggtaact 180 

tttaactgca aacctataaa gtactatttt ttattttata aatgaacagg gttttaacgt 240 

gctcaacttt aatttttttc aattgtatga aggccttaaa aaagctacat taagcgtagc 300 

taaaattatt tattggacta aaaactaaca gaacttcatt tccagaattt ttttttttgg 360 

caaatgttta cattcaatta aggggaaaaa gtagaaccag cacaaatgag tggcagttgc 420 

tggagcataa ctgcttcaat aaatcttcat cttggggtaa ttacaggcaa gtcattttca 480 

catcctcttg aggttcagag catcagaatg aactctatga atacatgtgt aagtgccaga 540 

cagctgaatc tttatcaggt attgnaaaga tacacatatg atatgnttat taaaattgaa 600 

ataatgtaaa acacatgaat aaatttgcaa aaccaagatc acagtccacc atatgcactc 660 

tggtacctta aatttttttt ataaataatt naaaagggaa tattggaagc ttcttaaaaa 720 

aaaaaaaaan aaaaaactcg agcctntana acttttgng 759 



<210> 667 
<211> 760 
<212> DNA 
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<213> Homo sapiens 
<400> 667 

ggnnttnaaa ctnctaatnc tgtnttgcag gatcccatct atncntatan angctctagg 60 

cggngcggnt cccactctcg gaaccttgtc ctgtttgtcc cccagctcgg caagcgccat 120 

atgagcctgg cggcgccaga tgcgaatcct gttctgggct ttttggccta ttcccgcccc 180 

tcagtcttgc cgggatggca ccgcccgcat aggacttcca gggttgggct gantgggagt 240 

tcgactgctg ggcctcgtaa ttctcgcttt ggggctgctc cttccaggct gggacacact 300 

ggggcccgct gtcggtctcc cgtcctccga catcttgtct ggaacttccg cctggcagtc 360 

tccagtagga gtggagctct gtgcggcgta ntttggtgga aaaacnggcc ttgcgtcggc 42 0 

ctcaccccca gtgtttgtgt ttcagaatga agactattct cagcaatcag actgtcgaca 480 

ttccagaaaa tgtcgacatt actctgaagg gacgcacagt tatcgtgaag ggcccagagg 54 0 

aaccctgcgg agggacttna atcacatcaa tgtagaactc anccttcttg gaaaagaaaa 600 

aaaagaggct tccggtttga cnaaatggtg gggtaacaga aaggaactgg ctacccgttc 660 

cggactattt gtaagtcntg tncagaacat gatcaaaggg tgttacactg ggctttccgt 72 0 

tacaaagatg aangtctgng natgcttaat ttccatnaan 760 

<210> 668 
<211> 763 
<212> DNA 

<213> Homo sapiens 
<400> 668 

gntctatgtg gctctngttn ttttgcggat cccatttgac gccttggcac gagaagaaaa 60 

cccatggaaa gtagcagtgt tgtgagttgc agagacagga aagatagaag acgttccatg 120 

tgttattctg atggtcgaag tttacatttg gaaaaaaatg gaaatcacac accatcctcc 180 

agtgtgggca gctctgtaga aattagttta gaaaattctg aactgtttaa agatttgtct 24 0 

gatgccattg agcaaacctt tcagaggaga aatagtgaaa ccaaagtgcg acgtagcacg 300 

aggctacaga aggatttaga aaacgaaggt cttgtatgga tttcacttcc acttccttcc 360 

acttcccaaa aagccaaaag aagaacaata tgtacatttg acagcagtgg atttgaaagt 420 

atgtctccca taaaagaaac tgtgtcctcc agacaaaaac cgcagatggc acctcccgtc 480 

tcagatccag aaaacagcca gggccctgct gctggttctt ccgatgaacc tggtnagagg 540 

aggaagagct tttgtatatc tacacttgca aatactaaag ccactttcca gttnaaaggc 600 

tnccggagaa gatcctctct ttaatgggga aagggagaga gctctcttga ctggccttgg 660 

gaaagggatt ggaacataat ggggagaaaa gaaagccgta attgacattt tctggcanan 720 

tcttgtnanc aagaggggna aagtnaccct tntntgcttg aaa 763 

<210> 669 
<211> 754 
<212> DNA 

<213> Homo sapiens 
<400> 669 

tgnttctaat gctngctctc gttctttctg caggatccca tctattcgaa ttgatgagcc 60 

ttattaacta tcttttcatt atgagacaaa ggttctgatt atgcctactg gttgaaattt 120 

tttaatctag tcaagaagga aaatttgatg aggaaggaag gaatggatat cttcagaagg 180 

gcttcgccta agctggaaca tggatagatt ccattctaac ataaagatct ttaagttcaa 24 0 

atatagatga gttgactggt agatttggtg gtagttgctt tctcgggata taagaagcaa 300 

aatcaactgc tacaagtaaa gaggggatgg ggaaggtgtt gcacatttaa agagagaaag 360 

tgtgaaaaag cctaattgtg ggaatgcaca ggtttcacca gatcagatga tgtctggtta 420 

ttctgtaaat tatagtttct tatcccagaa attactgcct tcaccatccc taatatcttc 480 
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taattggtat catataatga cccactcttt cttatgttat ccaaacagtt atgtggcatt 
tagtaatggg aatgtacatg ggaatttccc actgacttac ctttctgtcc ttgggaagct 
taaactctga atcttctcat ctgttnaaat gtgnattaaa gtatctacct aactgagtng 
tgantgtant gaaagaaagg ncatatntta aacnttgaat ttancaagcc cacnctcgna 
ttttatgncc tttcttttgc ctngggattg aanc 



540 
600 
660 
720 
754 



<210> 670 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 670 

tgnttctaat anttgctact tgttcttttt gcaggatccc ttttgacgnc tttggcacga 60 

gaaagaaagg gctcgtgaca gagaaagaag aaagagaagt cgttcacgaa gtagacactc 120 

aagccgaaca tcagacagaa gatgcagcag gtctcgggac cacaaaaggt cacgaagtag 180 

agaaagaagg cggagcagaa gtagagatcg acgaagaagc agaagccatg atcgatcaga 240 

aagaaaacac agatctcgaa gtcgggatcg aagaagatca aaaagccggg atcgaaagtc 300 

atataagcac aggagcaaaa gtcgggacag agaacaagat agaaaatcca aggagaaaga 360 

aaagagggga tctgatgata aaaaaagtag tgtgaagtcc ggtagtcgag aaaagcagag 42 0 

tgaagacaca aacactgaat cgaaggaaag tgatactaag aatgaggtca atgggaccag 480 

tgaagacatt aaatctgaag gtgacactca gtccaattaa aactgatctg ataagacctc 54 0 

agatcagaca gaggactact gttcgaagat ttttggaaga atactgagaa cggcataaag 600 

tgaagatcga catttaaaaa atgaggtgaa agaaagctnt tgtggcatag aaaaagtntt 660 

aagctcaant agttttttta ttattattat tattaaaagt tattcaggac tgatgtgact 720 

ncngatttna gaacatgtgg taatagtnta nt 752 

<210> 671 
<21l> 752 
<212> DNA 

<213> Homo sapiens 
<400> 671 

tgnttctaat gttgctactc gttcttttgc ggatcccatn ttattcgaat tcggcacgag 60 

gatattcaca cagtatgtat tatattaacc atatcacact taagttatta aattcagact 120 

atttgtaact tattgttata gggcctgccg tatggcttag gatatttgag taatcatata 180 

tttaaagtaa aaactttggg ctgggcacag tggctcacac ctgtaatccc agcacttggg 24 0 

gaagctgagg tgggcagatc agttgaggtc aggagttcta gaccagcctg gtcaacatgg 300 

cgaaacccca tctctactaa aaatacaaaa attagctggg cgtggtggca cacacctgta 360 

atcccagtta cttgggaggc tgaggcacaa gaatcgcttg aacccgggag gcggaggttg 420 

cagttagcca agatcgccct gctgcactcc agcctgggca acagagggag actctgtctc 480 

caaaaacaaa aacaaaaact gttagtgaag gttccctggg acttttgata ttttaaaaat 540 

tggtcttatg actagtagat aaattcattg ccataatgag gctagctccc agataaacag 600 

tgtattttct tctttttttt ttttggtgag tggtccaaac tttaagctac tttttccagt 660 

antttgccac tttctccgan gtaantttgg ctggtctttn agtaatgcta attgngtgtc 720 

aaaatttgtc tacaacagtt nggcaacaga tn 752 

<210> 672 
<211> 792 
<212> DNA 
<213> Homo sapiens 
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<400> 672 



tgnttctaat actngctact 
ggctgcttct ggctgggggg 
agagccctgg aggagcagga 
ctcaggtgtt gggaaatggg 
aaccagatgg caaggatgag 
tgttgcctcc acccccagca 
tctcacggcg ggcagtttat 
ccttctgccc ccgacctgac 
ctggacnaca ctggagtgga 
aaccggcaca ctggacttct 
atnnttgggg gatcccnnaa 
canccctttc cccttggccc 
gtangaaaan anaatantnt 
antnatcntn nt 



ngttctttct gcaggatccc tctattcgaa ttcggcacga 
tccttggcct tcatcctgct gagggtgagg aggaggagga 
ggaggagcca gtggcgacgg gggattctac gatccgaaag 
gaccccgtct tctggacacc agtagtccct ggtcccatgg 
gaggaggagg aggaggaaga gaaggcagag aaaggcctca 
ctcgaggatg acatggagtc ccagctggac ggctccctca 
gtgtgacctg gacacagaca gagacagagc caggcccggn 
cacgccggcc tagggttcca gactggttgg acttgttcgt 
acactgnctc ccactttctt gggactttgg agggangtgg 
tccgtctcta nggctgcatg gggagccctg gggagcttna 
aangaccccc tgtcccccat anacttgggt ttttngcttt 
cnnttgacca cttcatggag tttaattaaa atngcccttg 
tcctcntttt antgntnttt tnntataatt tnatnatcct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
792 



<210> 673 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 673 

nttctaatnc tngctacttg ttctttntgc aggatccctc gattcgaatt cggcacgagg 60 

cagcttcgag ccaatggtga gctccttctg gatcagctcc ttcagctcct tcttgctcag 120 

gatgctgaaa ttgcaaggct gatggaagac ttggaccgga acaaggacca ggaggtgaac 180 

ttccaggagt atgtcacctt cctgggggcc ttggctttga tctacaatga agccctcaag 240 

ggctgaaaat aaatagggaa gatggagaca ccctctgggg gtcctctctg agtcaaatcc 300 

agtggtgggt aattgtacaa taaatttttt ttggtcaaat ttaaaaaaaa aaaaaaaaaa 360 

ctcgagcctc tagaactata gtgagtcgta ttacgtagat ccagacatga taagatacat 42 0 

tgatgagttt ggacaaacca caactagaat gcagtgaaaa aaatgcttta tttgtgaaat 480 

ttgtgatgct attgctttat ttgtaaccat tataagctgc aataaacaag ttaacaacaa 540 

caattgcatt cattttatgt ttcaggttca gggggaggtg tgggaagttt tttaattcgc 600 

ggccccggnn gccaatgcat tgggccccgg tacccaactt ttgttccctt tantgagggt 660 

taattgcncc ccttggccgt aatcatggta atagctgttt cctggtgnga aattgtttcc 720 

cgtnacaatt ncacacactt ttcancccgg ggacn 755 

<210> 674 
<211> 753 
<212> DNA 
<213> Homo sapiens 

<400> 674 

tgcttctaat gcttgctact cgttctttnt gcaggatccc tcgattcgca gatttttgac 60 

aaggaaggct aattctaaac ctgaaagcat ccttgaaatc atgcttgaat attgctttga 120 

tagctgctat catgacccct ttttaaggca attctaatct ttcataacta catctcaatt 180 

agtggctgga aagtacatgg taaaacaaag taaatttttt tatgttcttt tttttggtca 240 

caggagtaga cagtgaattc aggtttaact tcaccttagt tatggtgctc accaaacgaa 3 00 

gggtatcagc tatttttttt taaattcaaa aagaatatcc cttttatagt ttgtgccttc 360 

tgtgagcaaa actttttagt acgcgtatat atccctctag taatcacaac attttaggat 420 

ttagggatac ctgcttcctc tttttcttgc aagttttaaa tttccaacct taagtgaatt 480 

tgtggaccaa atttcaaagg aactttttgt gtagtcagtt cttgcacaat gtgtttggta 540 
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aacaaactca aaatggattc ttaggagcat tttaatgttt attaaataac tgaccatttg 
ctgtanaaag atnanaaaac ttaagctttg ttttactaca acttgtacaa agttgtatga 
cagggcatat tctttgcttn caanattttg ggttgggggc actangggtt caaaaccctg 
gcanaattgt cnactttagn ctgaccataa tnc 



600 
660 



720 



753 



<210> 675 

<211> 760 

<212> DNA 

<213> Homo sapiens 

<400> 675 

tgntttctaa acnttgctct cgttntttnt gcaggatccc atctattcga attcggcacg 60 

aggttccctc accttattcc tccaagttcc cccttgggaa cctctgagat taacttgata 120 

agctccttgg gcaagctctt tatcctaaga ttcctcagtg agccttatag agttgctgcg 180 

agaattacat ttgttcatga tgtcaagtgt ctggtatgta gctaatgctt attgaacaca 240 

tagtaattta ttgaataatt gtcatgatca ctggatgaga tatagccact gtggaggtag 300 

gcacaccagg gttttagagg cttgggatct tgcaacagga ttttcctctt gcctctccaa 360 

actgcccttt gcccagatgg cttcagcatc tttttgcatc cctgtttcct tgtttggtga 420 

acacctgtct caacctgtct gcaaggcgtg gtgagattct gcatccttgg taagcactca 480 

tgtcactcca aaacagctgt ttgatgctaa tagcacacat gaggtcttgc aaatttgtct 540 

gaggaactac aggacattgg agagatattt atcaaacacc cactacatgc ctgatactta 600 

actanggaac tatnaaagtg ggtggtgaag acaagtgnga agtaaantgc aaacctattt 660 

ccatatatgt ttgnncgcta gattgntncc ancaattngc ntcttggaat tgttgaattn 720 

ggccctgtgt gtgtgcctgt ggtaantgga nntgngtttc 760 

<210> 676 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 676 

ntttgaaact tnctactngt tctttttgcg gatccctcna ttcgaattcg gcacgaggca 60 

gaaccttttc ccctctactc ttgtctaaaa gttctgtgtg gcacacagag atgcgaccta 120 

ctcaatctga cttagtaaaa ccatgctgta gaatttttgt cttaaaaaga ccacataccc 180 

agcacccatg aaataaaaga ttcatctgta attgggattc aaagtgatta aattcctttg 240 

ttcatactca taaatagcac taaagtgtta taacattttc atttacctat ttttagttcc 300 

ttcattttaa cttaataaaa atcttggatt gatattcttt tttttttttt ttgggacgga 360 

gtctcgctct gtcacccagg ctggagtaca gtggctctat cttggctcac tgcgagctcc 420 

gcctnccggg ttcacgccat tctcctgcct cggcctgccg agtagctggg actgcaggcg 480 

cccgccacca cacccggcta atttttttgt atttttagta gagacggggt ttcaccctgt 540 

tagccaggat ggtctcgatc tcctgacctc gtgatccacc tgcctnggcc tccaaagtgc 600 

tggaattnca ggcgtgagcc accgcgcccg ggnctaaatt ggatattctt taaccattaa 660 

aaggtttact gggtgnccna tttgccatat tattggaaac ttggaaaggg taatttgaaa 720 

caaagntttg aagttaactg aaatttgggg a 751 

<210> 677 

<211> 756 

<212> DNA 

<213> Homo sapiens 
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tgctttgaat cctttgtaan cgccctntnt gcatgatccc tcnattcgaa ttcggcacga 60 

ggataaactc ttcagtgacg aatattagaa ttagttagtt atacatttga ggaaaactat 120 

aaaagtacca ataatgagta ggaaatcact tctgcagtat ttttggagca ttttccttaa 18 0 

gcatgacata aaagccaaag gtcacaaggg aaaaaactga tagatttgtc tgtgatattg 240 

agagatgtat gcacatatac atacaacagt catagtaaga caccgttaga caaaaggtga 300 

tgtatgaaaa agaggcaaaa caacaagaag aaaagattga aaaaatgaga gctgaagacg 360 

gtgaaaatta tgacattaaa aagcaggcag agatcctaca agaatccagg atgatgatcc 420 

cagattgcca gcgcaggttg gaagccgcat atttggatct tcaacggata ctagaaaatg 480 

aaaaagactt ggaagaagct gaggaatata aagaagcacg tttagtactg gattcagtga 540 

agtttagaag cctgaaactt ttctcgtatg gggtggtttt tgcattaaat nctggggtcc 600 

attttacaat ccattatttt tgaccactgc tatgtgttca agtagtatga gaatgtgatt 660 

gntnttatct ggntcatata tatttctttg gctaatttaa tatgtcaaat aaatgagttc 720 

atttaaaaaa aaaaaaaaaa acccggactg ttttnt 756 

<210> 678 
<211> 756 
<212> DNA 
<213> Homo sapiens 



<400> 678 

gnnnnnnnnn nnnttnnaat agnnagctac ttgttctttt tgcaggatcc catcgattcg 60 

aattcggcac gaggggtgtt ggagcagatt gtagttgatc cacagcaaag agcatcacca 120 

aagccattcc aggaggaact agatccacca cttcctctgc tgggcatgct ccaaaaatgg 18 0 

ttgtggcttc cagagaggac tccaaaagaa agcacaaaaa ctagacagtg ggagggcata 240 

cccaaaagcc ctgagtttct gaaaaaatat tgaaagtttc tatggtgaaa taggaagtta 300 

atgtgcttag gaagaaaaaa gtggtaatga ttcaaggaaa cataatcaca cacggtttta 360 

gttttaatgg acatgggagg agccataaaa gtagtctatc tatcatcagt tacatatcta 420 

atgaactgtc tatctgggat accctatcct gttttaatct gagtgactct ctctcagctg 480 

agagagctgg acagactcca ttttagcctc ttcacttgca gtccccttat ccccctccct 540 

taagggaata actagtgcaa gctgacttca agcacattca ggaatgcact tactgataag 600 

atattgaggc aagctgtacc agcagcttct gggggacctg ctcantggat ggtcccaacc 660 

cctgcattta tctctttggg atagtttaag cccctgnacc tggaactgng tatttttctg 720 

tactatctct gtancattaa tttttttact ttttgg 756 

<210> 679 
<211> 747 
<212> DNA 
<213> Homo sapiens 



<400> 679 

tctaatnctt ggctctcgtt ctttctgctt gatccctcga ttcgaattcg gcacgagaaa 60 

tgactccctg caaaacccaa cccatgctgc tggctgtggg atttttggtg taagcctatc 120 

tatgcactct atcagccaga atttggcatt tagctcttag ttaaatctag taaaggacag 180 

tctattgttt aaagagaagg tgcatttgtt cctcaatcaa gcaagagcac ctgtgttgta 240 

ctgctttata tctcatgtat atttatagta atgaaaagac tttttaaatt gtacacgttt 300 

cagtgccttt cttgtgttat gaaaggcagg tagatattat agccataggt aaaaatccat 360 

agttaaattg cacactgacc ttaaatctct ctgtgtatgc ccttgtatct tgcatgttaa 420 

aagttggatt attgggcatg tgtggcagcc tgccctgcta catgctagac aagtgtgctt 480 

tagtacatag ccacaagttc ttcattcttt aaaatgtttt gacagatcat ctcataataa 540 

aaataattca ngaaaactat ggggaaatag ttacatttca caaaagatat tttaaactct 600 

ttgtaaaact tagataatag agcctancaa gttactttgn atctaattgg atacatttta 660 
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tgnttaattt taccaccata cattttatta atcaaaattg gttagcatgt gactcttttt 720 

ggcttcanaa gttntcaaaa aaattat 747 

<210> 680 
<211> 750 
<212> DNA 

<213> Homo sapiens 
<400> 680 

ttctaatnct tggctctcgt tctttctgca ngatcccatc gattcgaatt cggcacgagg 60 

accggctggg cctacaaaaa gatcgagctg gaggatctca ggtttcctct ggtctgtggg 120 

gagggcaaaa aggctcgggt gatggccacc attggggtga cccgaggctt gggagaccac 180 

agccttaagg tctgcagttc caccctgccc atcaagccct ttctctcctg cttccctgag 240 

gtacgagtgt atgacctgac acaatatgag cactgcccag atgatgtgct agtcctggga 3 00 

acagatggcc tgtgggatgt cactactgac tgtgaggtag ctgccactgt ggacagggtg 360 

ctgtcggcct atgagcctaa tgaccacagc aggtatacaa gctctggccc aagctctggt 420 

cctgggggcc cggggtaccc cccgagaccg tggctggcgt ntccccaaca acaagctggg 480 

ttccggggat gacatctctg tcttcgtcat ccccctggga nggccaggca gttactcctg 540 

aggggcttga acaccatccc tnccactagc ctctccatac ttactcctct nacagcccaa 600 

attcttgaaa gttgtctccc ttgacccttc tttaatggca acttaactga anaaagggat 660 

gtncncttat atccaaaatt cagctatttg gcaaataaac canatggatt aaaaaaaata 72 0 

attntntctt aananaaana actccggcct 750 

<210> 681 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 681 

ctaatncttg gctctcgttc tttctgctng atccctcgat tcgaattcgg cacgagccca 60 

gctgctcagg aggctgaggc aggagaattg cttgaaccca agaggcggag gttgtggtga 120 

gccgagattg cacctttgta ctccagcctg ggcaacgagc aaaaaactct gtctcaaaaa 180 

aaaaaaaaaa aaagaaaaag aaaaatggct tccaggacag agcatgctca tttgctggcg 240 

gacagttcca gaaacagacc ctgttagtcc ttctacttac ctgctggatt tttcaagcac 300 

taaatttata actttttgaa acaaaataat gtgtaatttt ccatttgggg gcaaactcta 360 

ttcttgtgag cattattaaa atcttgtttg taaatatatt gtctttctct taatatttgc 420 

tctgggtcan gaagaagctg ttcacggtgt gataatactc tttanattgt gctttcatta 480 

ttatagatgc atcatgtctt ctgctttcac gtgtctggga tggggtcaga aatgcatnct 540 

ccagntgaca naaaaatccn agnatgagat caanaaggat actggtgttt tctgactttt 600 

acaaaaatta ctttgntgtt ttcattaaaa aaaaagcttt aacctantgn ttncntantc 660 

cttttagaaa ntattaaatt tnaaaatgaa ttcnatanaa atanaannac naaaaaactt 720 

nntnccttta naactttagt gangcgtn 748 

<210> 682 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 682 

ctaatgctng gctttcgttc tttctgcagg atccctcgat tcgaattcgg cacgagcagg 6 0 

agcaatcaat tcctgtcgaa gtgaatacca tgcagctttt aacagtatga tgatggaacg 120 
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catgaccaca gatatcaatg cactgaagcg gcagtactct cgaattaaaa agaagcaaca 180 

gcagcaggtt catcaggtgt acatcagggc agacaaaggg ccagtgacca gcattctccc 24 0 

gtctcaggta aacagttctc cagttataaa ccaccttctt ttaggaaaga agatgaaaat 300 

gactaacaga gctgccaaga atgctgtcat ccacatccct ggtcacacag gagggaaaat 360 

atctcctgtc ccctaccgaa gaccttaaga cgaagctcaa ctncccgtgg cgaactnaca 420 

tccgagtcca caaaaagaac atgccaagga ccaagagtca tncgggctgt ggggacaccg 480 

tanggctgat agatgagcag aacgaggcca gcaagaccaa tgggctgggg gcagcagagg 540 

cattcccctt tggntgtcan gcgacagctg ggagagaang caagnaagcc ctgaangcna 600 

gtccaggagg accnncnaag ggcagtttcc ggagcccgnt gttccggaga tgctgatgtg 660 

ggntgtgtct gcanttcang gccaaanttg gggacccctg ggaactgtac cctangggnt 720 

ncttgnagnt taaaacttga ccttaanggn ngcct 755 

<210> 683 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 683 

ggntttnnnt ctttctaatg cttggctctc gcctntctgc ttgatcccat cnattcgaat 60 

tcggcacgag aattagtatc aacttacaat ccaagtccaa gtatcatctt ataatcactt 120 

ttttctacta tattaagatc taatgaattt gatttctttt ttgaagtttt ttcttgtaac 180 

atctgagatt agaagtttaa gatcacttga ccccaaacct ttgtttatgt aagaattttt 240 

aaacataaaa gtgtttgttt ctgttatgtt accataattt gatgtatata gtgtccagat 300 

ccatttagaa atttaatatt tattaataac tgaaactgtt tgtcttcctt tggtatatag 360 

tctcgcatat tatattatag caggccaaga taaaattttg acagctcttt aagcccacat 420 

gcagcagtgg gtcagataac cctgtggcag tgacacgggc aaattggcat ttgaataaag 480 

ccctgggacc acctcaacat gcgtagcctc ttgtcttaaa tgtactcccc atggcagcat 54 0 

ggaggaggca agacctgtgg gtcaattttg aactggnctt actttgattt taaaacaaga 6O0 

gactcagggg aaagtactaa accaaaaact ctgattntac tttgcgtttt ctggaagtnn 660 

ttggtttact gagatgcttt tgtaaaggaa aataatgctt gngacanttt agtaatttct 720 

acanaattcn ttaatatttc ttcctcntgg gcttn 755 

<210> 684 
<211> 774 
<212> DNA 
<213> Homo sapiens 

<400> 684 

ggntttnann ctttnnaatn cctttgctnc tcgntctttn tgctggatcc catcgattcg 60 

caagatctgg aggaatgcag agaggaactt gatacagatg aatatgaaga aaccaaaaag 12 0 

gaaactctgg agcaactaag tgaatttaat gattcactaa agaaaattat gtctggaaat 180 

atgactttgg tagatgaact aagtggaatg cagctggcta ttcaggcagc tatcagccag 240 

gcctttaaaa ccccagaggt catcagattg tttgcaaaga aacaaccagg tcagcttcgg 300 

acaaggttag cagagatgga tagagatctg atggtaggaa agctggaaag agacctgtac 360 

actcaacaga aagtggagat actaacagct cttaggaaac ttggagagaa gctgactgca 420 

gatgatgagg ccttcttgtc agcaaatgca ggtgctatac tcagccagtt tgagaaagtc 480 

tctacagacc ttggctctgg agacaaaatt cttgctctgg caagttttna ggttgaaaaa 540 

acaaaaaaaa tgacatgggt gcagaagctt gtaacattga tcacattctt aatgtaaatg 600 

gtgtctttct tctggggttt cagtatttgc aaagaaantg aagaagaatt ctggaaatgc 660 

cattcaatta accctnagga aaaaagccga ccttanaaat ttaccttant gcnttgnnnn 720 

ttaaaaanaa aaaaaantna aaaaactttn accctttana ccttttgtgg ggnc 774 
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<210> 685 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 685 

ggntttnnan nctttctaat ncttggcttn agttcttttg caggatccca tcgattcgaa 60 

ttcggcacga gagtacccag agttgcgagg agttttttaa ctgatttagc cnnntggcaa 120 

tcatgagtga atggatgaag aaaggcccct tagaatggca agattacatt tacaaagagg 18 0 

tccgagtgac agccagtgag aagaatgagt ataaaggatg ggttttaact acagacccag 240 

tctctgccaa tattgtcctt gtgaacttcc ttgaagatgg cagcatgtct gtgaccggaa 300 

ttatgggaca tgctgtgcag actgttgaaa ctatgaatga aggggaccat agagtgaggg 360 

agaagctgat gcatttgttc acgtctggag actgcaaagc atacagccca gaggatctgg 420 

aagagagaaa gaacagccta aagaaatggc ttgagaagaa ccacatcccc atnactgaac 480 

agggagacgc tccaaggact ctctgtgtgg ctggggtcct gactatagac ccaccatatg 540 

gtccagaaaa ttgcagcagc tctaatgaga atattctgtc ncgtgttcaa ggatcttatt 600 

ggaaggacat cttacagctt ccaatgagaa gccaagaagt tgtgaacata ctgattgaaa 660 

aaagacttta ttttaatccc tcattaaaan ggttttaaat gttaaaaaaa aaaaaaaaaa 720 

acttcgagct tttaaactat ngtgagtcga ttcntataa 759 

<210> 686 
<211> 749 
<212> DNA 
<213> Homo sapiens 

<400> 686 

ggnnttnnnn nctttgaaat cccttngctn ctagcncttt ttgcaggatc ccatcgattc 60 

gaattcggca cgagggcaat tagcctcgct taagttgcct tttttacaca ccaaaacttt 12 0 

ttacatgaag ggctggtttc acatgaatac tatactgaaa tctgtgctct caagatctag 180 

cagtgaccag ggctgcccgg cgggggctct cctggcaagt caggaaggtt tctgttgcta 240 

atataacata gaaacacatt agtgcactgg gcctctctga ggtcagcata tttgtactct 300 

tggaatattt gtttttttct tcagtaacaa cagaaacccc agttgggagt ttaacaaata 360 

actgactacc actcactcat gcatttttat ttccaattaa agcaaagcac tgtgctgtgc 420 

tcagataata atagtttgta agtaaaagtt tttagttttc agtgttcagg ttatagaata 480 

taactgacca taaaaattac ctgcaggtat tttcttttta tgaacttgtt tttaaattac 54 0 

caagtaatta ctggtgtcat tttgttttat gacagacaca cgtatctaac aaacaaacaa 600 

acagtgacct tctccatggg tcaaggactt ccttacaatt tctnctgagt taacttttgt 660 

gaaaataatc ctaaggtttt ctggcttatt gaggaaattn ctacaaacaa caaaccaaca 720 

acngaagaga agatcatcaa ccactgttt 749 

<210> 687 
<211> 760 
<212> DNA 
<213> Homo sapiens 

<400> 687 

ggnnttctaa tgctttctaa taccttggct ctngctcttt ctgcaggatc ccatcgattc 60 

gaattcggca cgaggaaatg tgtatttcag tgacaatttc gtggtctttt tagaggtata 120 

ttccaaaatt tccttgtatt tttaggttat gcaactaata aaaactacct tacattaatt 180 

aattacagtt ttctacacat ggtaatacag gatatgctac tgatttagga agtttttaag 240 

ttcatggtat tctcttgatt ccaacaaagt ttgattttct cttgtattac attttttatt 300 
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tttcaaattg gatgataatt tcttggaaac attttttatg ttttagtaaa cagtattttt 360 

ttgttgtttc aaactgaagt ttactgagag atccatcaaa ttgaacaatc tgttgtaatt 420 

taaaattttg gccacttttt tcagatttta catcattctt gctgaacttc aacttgaaat 480 

tgtntttttt tttctttttg gatgtgaagg tgaacattcc tgatttttng tctgatgtga 54 0 

aaaagccttg gtattttaca ttttgaaaat tcaaanaagc ttaatataaa agtttgcatt 600 

ctactcanga aaaagcatct tcttggatat gtcttaaaat gtatttctgt cctctataca 660 

naaaagttct taaattgatt tttacagtct ggaatgcttg gatgntttaa aatantaaca 720 

ttttatattt tttaaaagac aaancttata ttnatcctng 760 

<210> 688 
<211> 752 
<212> DNA 

<213> Homo sapiens 
<400> 688 

tgntttctaa tgcttctaat agcttggctc tngttctttc tgcaggatcc catcgattcg 60 

aattcggcac gagacaaaac ctacagatgg agataaaaat tactactgtt attcaacatg 120 

tgttccagaa ccttattttg gggagtaaag tcaattgggc agaggatcct gcccttaagg 18 0 

aaattgttct gcagcttgag aagaatgttg acatgatgta ataagaattc atttctgaca 240 

tattttacat ttctggcaat ctcaactctt atttggaata cttctgtgca tttgtctgtc 300 

caccgtaatt ttagaaaagc atatccataa cgtttacagt tgtagtacag ttgtggttag 36 0 

ttatttgtag tgggattgaa agtaattttt ttctttttat atttctatat ttagtttgtt 42 0 

tttttgttgt tgttgttttt tgagatggag tctcgctttg ttgcccagac tggagggcag 480 

tggcgcgatc tcggctcact gcaacctctg cctcccgggt tcaagcagtt ctgcctcagc 540 

ctnccaagta gctgtgacta aaggtgcacg ccgccatgcc canctaattt tttggatttt 600 

aagtagaaac cgggtttcac ccgtgttgcc caagctgctc tnaaaactcc tgagctcaag 660 

cagtccaccc gncttngcta ccggantgct aggattcaga cgtaagcccc cgaancctgg 720 

ctagtttgcn ttnttttctn tcattttata ag 752 

<210> 689 
<211> 806 
<212> DNA 
<213> Homo sapiens 

<400> 689 

gtgntttcta atgcttctaa tngcttggct actcgttctt tntgcaggat cccatcgatt 60 

cgaattcggc acgaggannt ctntgctatn gaacagnggc tggtnnacac tnnggantta 120 

nnnntgnacn ntannnattg nancanntan tactggnnnt ccntaatncn nttaatgtna 180 

cntnttgcaa gnngnnctga tnaaatacac gacaggaggg aaanctantg cgtcataggc 24 0 

acaggcagac ctaccgnnta aggagatnat ntnccnnang gntggctgtt gagnncatgc 300 

aactctggna tgtatttccc tttataggac caccttgtnc atngtggata aagcccctaa 360 

agnaggatgn naaagatgat cngatccaat acgttacnct gacannaaan nntgtnatac 420 

ntcngctgan caatctntcc ancnnntnta atatcgtgna tcacctaggg tgtatgatcn 480 

taggaactct gcncctncan tcnggactgt ccatcacnga ctnntgggct nctactgtac 540 

antangcgna gaanancnnt cannctacan ntaaccagat tggtgctgnn anatggtant 600 

gcnntttnan cncccacgac ncaataaagn ncnnctntnc cccanancct ntnnagggaa 660 

gaaaggaatt ttncatagtg ggctcaatga anggggtacc cttggncttt ntaaaaaacg 720 

ttncatggnn cctaccttaa acctgngtna actnanancn nttngncata angggtctaa 780 

cgnctatang gggnacnnat ttttnc 806 

<210> 690 
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<211> 772 
<212> DNA 
<213> Homo sapiens 



<400> 690 



ntntttgaat ctttgaaata 
attcggcacg agaggttgct 
tcagcttcct caagcacaag 
atgacgccat cctgccgcag 
cctgcgacct cgggggggcc 
acaaacacaa cctggagtac 
ctgtctttca cgtcaagtac 
cctcccactn cccgcctacc 
gacggctccc cctgaggccc 
gcaccctgcc tgccgggtcc 
cctacgaccc tggagaggag 
gggaagcttt aagnnccttc 
gccanccggt nattaaccaa 



cctttgctat ngttctttnt gcaggatccc atcgattcga 
cacctgaagg agcacaggag ggttttccag gccatgtggc 
ctgcccctca gcctctacaa gaaggtgctg ctgattgtgc 
ctggcgcagc ccacgctcat gatcgacttc ctcacccgcg 
ctcagcctct tggccttgaa cgggctgttc atcttgattc 
cctgacttct accggaagct ctacggcctc ttggacccct 
cgcgcccgct tcttccacct ggctgacctc ttcctgtcct 
tggtggccgc cttcgccaag cggctggccc gcctggccct 
tgctcatggt cctgcctttc atctgtaacc tgctgcgccg 
ttgtgcaccg tccacacggg cctgagtttg gacgccgacc 
gacccagccc aagacccggg cctttggaaa acttccctgt 
nanangccac ttacccaacc ttgaggggnt ccaaangccc 
ggccctggnc aatgcctgaa ggtcaaacaa tn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
772 



<210> 691 

<211> 755 

<212> DNA 

<213> Homo sapiens 

<400> 691 

ntgctttcna atctttntaa atgcctttgg cttctcgntc tttctgcagg atcccatcga 60 

ttcgaattcg gcacgagaaa aagtaaagct tttcatgagc acaaatncct tgcattgttt 120 

gatgttactg atattcgtaa aatgaatatt ttttgttttg ttttgtttta tttttttgag 180 

acaagtcttg ctttgttgcc caggctggag tgcaatggca tgatcttggc tcactgcaac 240 

ccctgccttg cgagttcaag tgattcttct gcctcagcct cctgagtagc tgggattaca 3 00 

ggcgctcacc accacaccca gctaatttct gtatttttag tagacacagg gttttaccat 360 

gttggccagg ctggtctcaa actcctgacc tcaaactcct cacacctgta atctcagcac 420 

tttgggaggc tgaggtggaa ggatcacttg aagccagagt ttgagaccag cctgtgcaac 480 

acagcaagac cccgtctcta caaaaactta aaaaattagc tggctgtggt gttgctcacc 540 

catagttcca gctactcggg aagctgagca ntaagatcac ttgagcccan gaggccnatg 600 

cttncantga actgtgattg tttccantac agnccacctg ggtgacanag taaanaaaan 660 

gaaacattac ataatttggc tagagcataa taaattgatt tctgggttnt gaaattnnag 720 

ttgccataaa aggnntttna atgngcnant tcant 755 

<210> 692 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 692 

tgnttttaat cnttctaatn cttggctctt gttctttttg caggatccct cgattcgaat 60 

tcggcacgag gtccgaagaa aaagactgtg gtggcggaga tgctctctcc aatggcatca 120 

agaaacacag aacaagtttg ccttctccta tgttttccag aaatgacttc agtatctgga 180 

gcatcctcag aaaatgtatt ggaatggaac tatccaagat cacgatgcca gttatattta 24 0 

atgagcctct gagcttccta cagcgcctaa ctgaatacat ggagcatact tacctcatcc 300 

acaaggccag ttcactctct gatcctgtgg aaaggatgca gtgtgtagct gcgtttgctg 360 
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tatctgctgt tgcttctcag tgggaacgga ctggaaaacc tttcaaccca ctgctgggag 
agacttatga attagtgcga gatgaccttg gatttagact catctccgaa caggtcagcc 
atcacccacc aatcagtgca tttcatgctg aaggattaaa caatgacttc atctttcatg 
gctctatcta tcccaaactg aaattctggg ggaagagtgt agaacagaac ccaaaggaac 
catcaccttg gagctncttg aacacaatga ggcatataca tggacaaatc cacctgctgt 
gtgcataata tcattgnggg taaactgtgg atcgaacagt ntggcaatgt ggaaattnta 
accncagact ggggacaaat ntgtgttg 



748 



480 



540 
600 
660 



720 



420 



<210> 693 
<2il> 881 
<212> DNA 
<213> Homo sapiens 

<400> 693 

tgnnnngtna accagggaaa agctnngttt gaactccttg ggcatgatcc catcgattcg 60 

aattcggcac gaggcggtga cccacgtgtc cttttgattg ccctactgct gtggagacct 120 

cgtgctgacc atctggcagt gntcttcgta ttctctggcc tgtggggcgt ggcaagatgc 180 

ccgtctggca gacacaaaac aatgctctct acggcgttct gtttganaag agcaaggaag 240 

ctgccttcgc caattaccgc ctgtgggagg ccctgggctt cgtcattgcc ttcnggtaca 300 

gcacgttttn gtgcntgcac gtcaagctct acattctgct gggggtccng agcctgacca 360 

tggtggcgta tgggcttgtg gantgcgtgg agtcccaaga accccgaatc anacccnact 420 

ctttcaggac aggtcaanca agtcagagga tgaagaanat tcanacaaan atgtgatanc 480 

cngngaggcc naangaggan naantnataa aagcaccagc cagaagaatt ttcttanaan 540 

atgcctnagg gacatatcan ccggggttct cattacccat cttaancncc anatttngnc 600 

ccattcttga aataagantc nttgnttnaa ttntcaactt ctttttatgg tnatttcnat 660 

ntatctantt antaaaacca caaatntgtt nncnatnacc accanttctt ttaaaccatn 720 

tagnaattca aangntgtgt nnttacnaat ntntaanggg ttattcaaan ttcnaaattt 780 

taaanattnt tatgcantnc ncacaatnta tataanangg tcctnaaaac gngnnccaat 840 

atnncannnc nataatntag nanatntntn nnccntgtan n 881 

<210> 694 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 694 

atngcttggc tctngttctt tctgcaggat cccatcgatt cgaaaattta tagtaatgac 60 

aaatgactta tcagtgttca tcatctgaaa gctaagtggt tcgttcaatc actttttcaa 120 

agttgatagt agattgcatg gtttcatgtt tcctcatatt ggtttattaa ttctatttaa 180 

tcaaggaaaa taacttcaga ttccataaag tttcagttta tttttagttt actactaggt 240 

gagatagcac attacatact tttactatca aatattattt tagcagcttc ccatagtacc 300 

aaatgatttg attccctact ctcatttttt aaagcatata aatatttatg ggcttaaaaa 360 

gggggttttt aaaaactgag gatatcanta ataaattgca gaatattttg caaagctttc 420 

ttttggaaag caaacttttg tgcctgccta tatgcnaagt attttatcag ggacttgaac 480 

aaagacctca ctctttttca cttgtcttat gtcgagagaa aaggttattg gcagncacat 540 

tcctaanact ggggaatggt gtgtnctttt naaatttgaa gataactttt agggtaatta 600 

tggaaactcc tcaaangagg ganaaagtna tttttttcca gacatttttc ctcaattctg 660 

ggtctttcac acactanntt tccatagtnc nagaatttct gnntttttac catttgggct 720 

gtgaaatgtt cacaatntcn ng 742 



<210> 695 
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<211> 745 
<212> DNA 

<213> Homo sapiens 



<400> 695 



tttcaaatng cttggctact 
gaggctagac gaagtggtga 
aatcacgtag tccttcctga 
tgtaggtgat attaaaattg 
gtctcgctgt gtcccccagg 
gcctcctggg ttcaagcaag 
cagtatttgg ggcaagaagt 
aggtgttgta tctatggcta 
ctatggcaaa cagccacatt 
tctggaaact ctagatgagt 
cttaatgatg tacacattaa 
attgatngan ctggacaaat 
taccccgntt ttaattatgt 



tgttcttttt gcagggatcc catcgattcg aattcggcac 
agcccaaaga cttatttttg agctcgctgt aagactgaga 
aaccactaag aggaaaaatg tctgtgacac tgcatacaga 
aagtcttctg tgagaggaca cccaaaacat gtgagatgga 
ctggagtaca atggcgcgat ctcggctcac tgcaacctcc 
tcttctgcct cagcctcccg agaactggaa gaggaggcaa 
ttgaggatga atacagtgaa tatcttaagc acaatgttag 
ataatggccc gaacaccaat ggatctcagt tcttcatcac 
tggacatgaa atacaccgta tttggaaagg taatagatgg 
tggagaaagt tgccagtaaa tgagaaagac ataccgacct 
gggccntaac tattcatgcc aacccatttg ctcagtagct 
tactttgncc aaattgctng aacacacttt attggggggt 
canaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
745 



<210> 696 
<211> 795 
<212> DNA 

<213> Homo sapiens 
<400> 696 

tttcaaatng cttggctant ngttcttttt gcaggatccc atcgattcga attcggcacg 60 

aggctggcca aagccaaatc tcctaagtcc accgcccagg agggaaccct gaagcctgaa 120 

ggagttacgg aggccaaaca tccagctgca gttcgcctcc aagaaggggt ccatggccct 180 

agtcgagtcc atgtgggctc tggggaccat gactattgtg tccggagcag gaccccccca 240 

aaaaagatgc ctgccctagt cattccagag gtgggctccc gatggaatgt caagcgccat 300 

caggacatca ccatcaaacc tgtcttgtcc ttgggcccag ctgcccctcc gcccccatgc 360 

atanctgcct cccgggagcc gcttgatcac aggactagca gtgagcaggc agatccctca 420 

gcaccctgcc ttgccccatc cagcttgctg tcccctgagg cctnaccctg ccggaatgac 480 

atnaacacta ggactncccc tgaaccctca gccaagcanc ggtcaatgcg ctgttacccg 540 

aaaaagcctg caggtcaagc cagcccctta agccagggct tggcangggc ccgccnaagg 600 

ccgnaacaag accgntctgt naactcttgg gttccaaacc cggaactttg cccgaaagca 660 

tttntttccc ttaattcctt caattcaatc cggnctttcc ttaatttccn ggattcttng 720 

ggtccaaggg tccccttttt tcccccccaa naacaaagaa aaggttgggc ccgaaanggt 780 

cccaaccttn ttnnt 795 

<210> 697 
<211> 734 
<212> DNA 
<213> Homo sapiens 

<400> 697 

ctaatagctt ggctactcgt tctttntgca ggatcccatc gattcgcagc cctcttccct 60 

cccctgtcaa gtcacttacc atgcaaacca caggctctaa gagtttgtcc ccagggacat 120 

ccatccaagt catctccatg gctcctgggt cccctggtga gcatggagtc aggaggtcat 180 

caatcatcat gctggggttg gtgcgagagg ggccacagac ctgaaaccaa atggatctga 240 

ctggggcagc tgcccctcag tgtcagaggg gctcgacccc tccggtctct aaggaagtcc 300 
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caaagagaat gctctgtggg tccctagcat ctgaggagga cgggctcctt cagaactcgg 
gctgggtggt ccgagcgact catgatttgc atgggactct ggcaatctgt agccccaatg 
ccttgatgtc ttcctcatta acactgtcac gtctcaccag gaatacagtg acattaaaag 
tgtgatatgg tntagctgtg cccccaccca catttcaact tgaactgtat ctatctccca 
gaattcccac atgttgtggg anggacccag ggggaggtaa ctgaatcatg gnggctggtc 
ttttcccgtg ctattctcgt gaatngtgaa ntttnacgag atctgatggg tttatcaggg 
gttttccaat ttttggttct tatttttctn ttgcaatctg catttaagna antgccnttn 
ggtctctaac antn 



360 
420 
480 
540 
600 
660 
720 
734 



<210> 698 
<211> 728 
<212> DNA 
<213> Homo sapiens 

<400> 698 

ttcnaatngc tnggcttttn gttctctttg caggatccca tcgattcgaa ttcggcacga 60 

ggtttaattt aaacctctca tcttttttta agcactcact gantttgacc gagacagcca 120 

gtcgccgttg aggaatcctc tgttgtcaac atcgagaccc ctggttttcg ggaaacccaa 180 

tggtgatgca gttgattatc agaaacagct gaagcagatg attaaggatt tagccaaaga 240 

aaaagataaa actgagaaag aattgcccaa aatgagccag agagaattta tccagttctg 300 

taaaactctg tacagtatgt tccatgaaga tccagaagaa aatgatttgt atcaagccat 360 

cgccacagtc accacactgc tgctgcagat cggggaggtg gggcagcgag gcagcagctc 420 

tggaagctgc tcccaggagt gtggggagga gctgcgggct tcagctcctt ctcctgagga 480 

ctcggttttt gcagacactg ggaagacgcc ccaggactcc caggcatttc cagaggcggc 54 0 

agaaggggac tggactgtct cccttgaaca tattttagct tcacttctga ctgaacagtc 600 

attagtcaac tttttttgaa aagccactgg acatgaaatc caaacttgaa aatgccaaga 660 

tcaatcagtn caatctcaaa cttttgaaat gaccncaatc caatctggac ntaagctgag 720 



<210> 699 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 699 

tttcaaatcn cttggctntt ngttcttttt gcaggatccc atcgattcga attcggcacg 60 

agggaaaaac aacaggtttg agtcctataa agccataatt taactccagt agctgatgtc 120 

agacaagctt gtcctatgtc ctatttgagt ggcagcagcg ccagcccagc aagaaggctg 180 

ggggttgtca aggttgtccc cagaccttgc ttgcagtggt tggagaaccc agggggctgc 240 

cttgggccct ctggccagag ggaagcgggc agctctagcc ctggagattg tggtcacatt 300 

ggggcttgtt taggattgga gggccaggtc acctccccag ccaccctccc ttctctcctc 360 

tggggtcccc actttagggc gactttgccc gagcccacgc atccatccac tcctttagtg 420 

ccttgaatct cattcacaag cagccccctc ccttcccctc cccttctcac tctgttgatg 480 

taatcctncc acccccagtg tccatcctaa gacaggcatc aaaaagaggc cctaacttta 540 

cttnccaaat ggtgcttttt aaaaaacacc atcactacat tangggcaat tttttcacac 600 

cttcctgtct tcagaatgta aaagggtggg ggaattattg tctctggtta aatntgcacn 660 

cccttgactt gtgggggttt tggggcatgt tcanntattt angaatgaat tncaattnga 720 

caaaaggggg tttantnaat tgttnt 746 



tacttgtn 



728 



<210> 700 
<211> 759 
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<212> DNA 

<213> Homo sapiens 



<400> 700 

gntttgaaat ccctttgctt tnaaatcctt tgctanttgn tctttttgca ggatcccatc 60 

gattcgaatt cggcacgaga taagggtggg gccttaattc agtagaa.ttg gtggcctcct 120 

aagcagagga agagagattt ttctttctct ctctgccatg tgaagacagt gaggagtcgg 180 

ccgtctgcaa gccaagaaga gcccttatca ggaacagact tggctagcac cttcatcgtg 24 0 

gacctccagc ctccagaatt gcaagaaaat acatttccgt cgttgaaacc acccagtctg 300 

tggtattttg ttatggcagc ccaggcagac taatacgtga agcctgctct aaatagataa 36 0 

aataagaaat tactacagag ggctctttag aaattgtatt taaaaacaag acaatccata 420 

tttacctaag atttacagaa tgtatgtcta taaaaggagg gatttctgga ctagatgatg 480 

atgaaaaatg ttcatataaa ggcaccttca gcttcgagtt gccaacacag gaggaagaat 540 

gctccctgct gttcagatgc tgatatgtgt cctgtgcttt ctggatggcc agtgggatca 600 

taagctggta gaagccagaa ctttcatcca ctgacttcat attcttncac atnctggaac 660 

tgtgggtgtt tgacctttta aaaaataaat ttaagcaaat tgaaatgntt tcctttgaga 72 0 

nttttggcca naaacccaca tnganatttt ncgtctncc 75 9 

<210> 701 
<211> 751 
<212> DNA 

<213> Homo sapiens 



<400> 701 

gcttnnaatt ccnttccaaa gnaaaccctt tgnaaattnc cctttctgnt tggatcccat 60 

ccgattcgaa ttcggcacga gggtaagtca ggtgattgaa tcccggaant nttcattgtc 120 

ttcaagctca caatactatt ttgggacaaa cagttgtcta gtgtttggac tcatgaaccc 180 

tgattcttga gggtggtatt ttactgcttt tgtgatttgg tttcaacata tatagtcttt 240 

tctccggagt taccttaggt cagtggccag tgtttcagcc cctggaaagg gcatgggctg 300 

ccactgaggt tggtcacagg cctctcagct catggtggga gtgggttcag gagttggtaa 360 

gtagggttca gttctgttgt tgccaccgat ggcaacaggg gtttgtaata atccctagtt 420 

gtgtcaatta tgtcacttaa ttttcacaac aggtctctga agtgtttctc atctcatttt 480 

tacagatgag gcctgcctgt gttaatacac ctagtgagga gtggagctga atttgaatgc 540 

aagccttggc accttaattg agcaagtttg aaacctcgct tgttgccctt ctggaaggag 600 

tcangaattt ncagttctgg gcctgggctg tgggtctggc agacagacct ctggccctaa 660 

ggtttgggtn ccangttctc tgcttccaga atgagaagct ttgctgtgca ccaagnanct 72 0 

gggcccctct ggnatctcnt gaatnaaaan n 751 

<210> 702 
<211> 748 
<212> DNA 

<213> Homo sapiens 



<400> 702 

gntttgaanc ccctttnntt naaatccttt gctacttgnt ctttttgcag gatcccatcg 60 

attcgaattc ggcacgagcc tgaatataaa gaggaggagg aagaccaaga catacaggga 120 

gaaatcagtc atcctgatgg aaaggtggaa aaggtttata agaatgggtg ccgtgttata 180 

ctgtttccca atggaactcg aaaggaagtg agtgcagatg ggaagaccat cactgtcact 24 0 

ttctttaatg gtgacgtgaa gcaggtcatg ccagaccaaa gagtgatcta ctactatgca 300 

gctgcccaga ccactcacac gacatacccg gagggactgg aagtcttaca tttctcaagt 360 

ggacaaatag aaaaacatta cccagatgga agaaaagaaa tcacgtttcc tgaccagact 420 
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gttaaaaact tatttcctga tggacaagaa gaaagcattt tcccagatgg tacaattgtc 480 

agagtacaac gtgatggcaa caaactcata gagtttaata atggccaaag agaactacat 54 0 

actgcccagt tcaagagacg ggaatcccag atggcactgt taaaaccgta tatgcaaacg 600 

gtcatcaaga aacgaagtac agatccngtc ggataagagt taanggcaag gagggtaatg 660 

tgctaatgga cccgaactgt gacgatcctc atgtgatcat gaagtaccag tactgacttt 720 

ttatgttaaa aaatgtccat ttactgng 748 

<210> 703 
<211> 769 
<212> DNA 
<213> Homo sapiens 

<400> 703 

ggnnntnnna gnntttgaan tccctttnnt tctaatncta ggcttctngt tctttttgca 60 

ggatcccatc gattcgctca gctgaggcaa ttaaactgga aaagaaatag attgaaaaga 120 

tactacagaa gaagcagtac agaagttggg ggactgaagg agagggagcc actgcaggtg 180 

ctagctgctt aaggggatac cagtcctttt acagatataa tagatacagc ttctgaggtg 240 

gagggtgata ggagtgtgta gagaaattgc agttcagaac tggagcatgc agttaggcaa 300 

gaggcatccc atgtgaagat gtcaagcaag tactggaaaa tgctgaacta aaactcaggg 360 

atggatatgt agatttagag aacttcattg tagaggcagt cattgaaagc taaaagggct 420 

gataataaaa ttgccaagga tggaaatagt aagagggagt cagtgttatt aggattagaa 480 

ttctgttttg ttttttcttt aaacagattc tcgctctgtc accctggctg gagtgaagtg 540 

gtgtgatctc ggctcactgc ggcctcgacc tcccaggctc aagttatcct cccaactctc 600 

agccttccaa gtagctggga ccacagccat tcaaacacat gcctgcctta tgtttggatt 660 

tttttgtana aaccaaggtt ttgccatgtt tnccaggctg gnctnngaac ttctgggctt 720 

aagccattcc cccacccttg ggtctcccaa aatgctngcc attatangg 769 

<210> 704 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 704 

cnaannncnn ggnttcnaat annaggctac ttgttctttt tgcaggatcc catcgattcg 60 

aattcggcac gagacccgtc cggggccggc caatttgcat atttggaatg cgccgctata 120 

aacccggctg gggttttgca gcgatttctt agatgtaaaa atgagatctc aatagcagcg 180 

ggctgggcac atcctctcct ctctccttct ctctctgccc ggagctggtt tccgtctctc 24 0 

ggctcggggc tggaactccg gcccaaccta ggcgcgcagc cgccacgaga tggcgcactt 300 

ccgatcaatg tcaaagccgc cggggagccg ggaaccccag catgattctt ggcctttgtt 360 

cgcttctgat actaagagca gcacggtaca ttatttcact tgtcccgctc cccttcataa 42 0 

cagaaaaagg ggactcaccc tcaagaagtg attggtatgg taatttaaag caacgcgcat 480 

tcgctaggcc tcgcgagcgt cgccgcgcgg agaagccagc tgtcccttgg cagtgatttc 54 0 

ggaaatgtgt caaggcaatt ccaaaggtga aaacgcagcc aactggctca cggcaaaaga 600 

gtggtcngaa aaaagcgctt gccccttaca cgaagcacca gacactggag ctggaagaan 660 

ggagtttctg ttcaatatgt acccttactc gaaaagcggn gcctagagaa taacccgcan 720 

cgttccacct taacggacag gacaagtgga aaaatcttg 759 

<210> 705 
<211> 777 
<212> DNA 
<213> Homo sapiens 
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<400> 705 



tttgaaatcc cntttnttna 
tcgtcctgaa gctcgggggg 
aggtgctcgc cgtgcgcgtg 
ccgaggagga ggctgagctg 
aggtacacct cctgccaggc 
tggcgctgcc cgagcatgat 
ccacctgccg ggaccgctac 
aggctgagta ccaggtgctg 
tccaggagct actgggctct 
cangactgga ctgggatgcc 
ttcttaagca tggaggacat 
cttgaaaggc caaaaaaaaa 
ttttaantgn ngnccgnttt 



aatcctttgc tncttgttct ttttgcagga tcccatcgat 
ctgcaggtcc tgaggaccct ggtgcaggag aagggcacgg 
gtcacactgc tctacgacct ggtcacggag aagatgttcg 
acccaggaga tgtccccaga gaagctgcag cagtatcgcc 
ctgtgggaac agggctggtg cgagatcacg gcccacctcc 
gcccgtgaga aggtgctgca gacactgggc gtcctcctga 
cgtcaggacc cccagctcgg caggacactg gccagcctgc 
gccagcctgg agctgcagga tggtgaggac gagggctact 
gtcaacagct tgctgaagga gctgagatga ggccccacac 
cgctagtgaa gcttgaaggg tgccaaccgt gggttgggct 
ttttggcaat gcttggcttt gggccattta aatgggaaac 
aaaaaantna tntnaaaaan aaacttnnac cttttaaaac 
tacnttanat tccagacttg attaggaatc cattttt 



240 
300 
360 
420 
480 
540 
600 
660 
720 
777 



60 
120 



180 



<210> 706 
<211> 760 
<212> DNA 
<213> Homo sapiens 

<400> 706 

gntttgaaat nccnttnntt caaatnctng gctacttgtt ctttttgcag gatcccatcg 60 

attcgaattc ggcacgagna atgcaaaggg ctgcagttct cattcaggct actttcagga 120 

tgcacagaac atatattaca tttcagactt ggaaacatgc ttcaattcta attcagcaac 180 

attatcgaac atatagagct gcaaaattgc aaagagaaaa ttatatcaga caatggcatt 240 

ctgctgtggt tattcaggct gcatataaag gaatgaaagc aagacaactt ttaagggaaa 300 

aacacaaagc ttctattgta atacaaggca cctacagaat gtataggcag tattgtttct 360 

accaaaagct tcagtgggct acaaaaatca tacaagaaaa atatagagca aataaaaaga 420 

aacagaaagt atttcaacac aatgaactta agaaagagac ttgtgttcag gcaggttttc 4 80 

aggacatgaa cataaaaaaa cagattcagg aacagcacca ggctgccatt attattcaga 540 

agcattgtaa agcctttaaa ataaggaagc attatctcca cattagagca acagtagttt 600 

ctattcaaag aagatacaga aaactaactg cagtgcgtcc ccaacaagtt atttgtatac 660 

agtcttatta cagangcttt aaagttccaa aaggatattc aaaaatatgc caccgggctt 720 

gccacactta attcagncat tctatcnaat gccccagggc 760 

<210> 707 
<211> 856 
<212> DNA 

<213> Homo sapiens 
<400> 707 

gttgctttga agcctttgaa atncnttgtt tnaaatnctt ggctttngnt ctntttgcag 60 

gatcccatcg attcgcctcc ctggatgtgc agacatggag gaggacagaa ggcccagctc 120 

agtggccccc gctccccacc ccccacgccc gaacagcagg ggcagagcag tctggaggtg 180 

gtgntcccac ttgatgaaga gcaggcgact ggnttggaga gggagatcat gctggctgna 240 

aagaanggac tggacccata caatgtactg gccncaaagg gancttcagg caccagngaa 300 

gacccaaant tantncccta catntccaac aagagaatag naagctgcat ntgtgaanag 360 

gacaatacca gcntcnantg gttttggctn nacaaangcc angnccancn atgcccccnn 420 

tttgnaaccc attacaanct gntgccccan tagctggcac actgancncc tnntctaaat 4 80 

tacttaaaat natgctgtan aagtatantn tttncagaan agactaanca ntncatngnc 54 0 

tacttctcca aaaaaaantg anaaaaatna taaaantcaa antaaatact aaatnannan 600 
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ataanancan tannaantta tatttcnnan atantanann nancnnttta naannantta 
nggnnancan nnattantnn tnnatanntt acattaaant tattnanann anaaannnan 
nananannat atattannan anantnacnt aaactnnnnt naatnntcca nanacttnaa 
naanaataag nnntanatna nnnnttangn ntnatatann ttnanatann nnnnacnata 
nnacatnnnn tannga 



660 
720 
780 
840 
856 



<210> 708 

<211> 766 

<212> DNA 

<213> Homo sapiens 

<400> 708 

ctaatactgg ctacttgttc tttcnaagcn ctggnntttn annnatnnag ctacttgttc 60 

tttttgcagg acccatcgat tcgcccaaac ttatcggggg tgccagaggc agagtagaca 120 

agccttagtg gccgccattt gttgaatatc tactgtgcgc caagcagtgc gtcacaactt 180 

tatgaagtag gtattattat catccccatt ttacaggtga agaaactgag tctctgagag 240 

accaactttt ccaaggtcac acagaggtgg gatccagccc acttccgtct gaccccaagc 3 00 

ccctgctgtt aacccctgcc ccattgtggg gaggttccgg cccactctgg agttctctgg 360 

tctgcgtcag tcctcaggag aagaaagaat gggggtgatg ctccaaatat tgaggctccc 420 

atctgtctgt cctgcactag gcagagccag gcttctccat ggggcacagg agagagggca 4 80 

ccagatctga ggagcaaata ggttcttggt ctgagatctc atgggatcag gttgccagcc 540 

ctgcaaaccc ccgctcangt ctagaggaca tggagctgcc tttcaaggtg catttgcttc 600 

ctttacagac tcggactctg tnctctggct actttgggcc gtcccggact cgggaatgcg 660 

tnctacactt gtaggggcaa aaccccggtt tgactctttc cgggttccta cccttaacca 720 

agcctttact ttctngggat caccctgttg ggactttttg tccacc 766 

<210> 709 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 709 

gaannccntt nnnttgcaaa tnntnggcta cttgttcttt ttgcaggatc ccatcgattc 60 

gaattcggca cgaggttttt tttttttttt tttggagaat gaatgcaaga tttattgagt 120 

ggtggaagta gctctcagca gatggctggg gagccagaag ggggatagca tgggaaggta 180 

gtcttcctct ggagtctggc tgctcagcag ccgggatctc ctactgtcct tggccgaatt 240 

tcccttggcg tccgaatcgt tccaccatca atggcctgcc agcgtctttc gatgtgttct 300 

tctgccagtg tgttcctctt gacgtccagc cgcttgtgtg tgtgcccgct ggggtctcag 360 

ggtttttata ggcacagaat gggtggcatg gcaggccaga gtggtcttgg aaaatgcaac 420 

atttgggcaa gaagacagga gtccttgttc tcattaggtc catgggcaca agcctgaggg 480 

tggagccctt gccagtgacc ctgcccttct ctacccagca cttccctgtc cccctcccat 540 

atcaccgttg ccatcttgtc cttgatgagg aatacaactc ccaattcagt gnttgcttgt 600 

gggaagatgc aatcctcttt atgacaagtt tctaanaagt tgataagaaa aatggggacc 660 

tgcctaaggg ctagtatctc atttaatact ctatagaata ttatgnggtt ttccctttta 720 

ngttttaaat gttgaananc nan 743 

<210> 710 
<211> 753 
<212> DNA 
<213> Homo sapiens 
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<400> 710 

gnnnnnnnnn nagngtttga antcctcctt ngaaatcctt tggcnactcg ctctttntgc 60 

aggatcccat cgattcgaat tcggcacgag gggcaatgca gttataatac tgtgttaatt 120 

tcagacatct tctggtcctc cgagccttgt atttacatac tagctgaaac tgcaagtgga 180 

aatgaatgga gctgatgata tttgccttat cctaattttt ctgtgaggag gagaaaaaca 24 0 

cttgtgcttc aaataagcag atgtgaaaac acttctcact aatcaaaatg tttaccacta 300 

ggttatgaga gtctgcctct cataggcagt gaatctgata tgtatactta gtaatataag 360 

tctatttagt ttgacaaaac cttagagcag aatttttgca gcttagttca ggatgatcac 420 

tagcaatgcc aaacttcatt ttttattgaa cttggatcca agaaggcctg ctgtgtctat 480 

ttcagtatag actctcatac caatatattt atgctccaag tcactacacc cagaagtgat 540 

gcagtggggg aaatgcaaag acaacatcac tgtaagattc acagaatgga tcttttgtaa 600 

aatattttat attgacttaa ggaaaacctt tcattgggaa ttaattaaat taagtctcta 660 

atatcctgga agacagtaaa aantnaagcn ggtgntctca antttgaacc cggcnattng 720 

naatttcatt ataggaattt ctgaaaataa tec 753 

<210> 711 
<211> 718 
<212> DNA 
<213> Homo sapiens 

<400> 711 

naatngctag gctacttgtt etttttgeag gatcccatcg attcgaattc ggcacgagcc 60 

tacttattgg atgttggctc tttggtgtca tggagatggc tttactgtag gtttgttgtg 120 

ttgeattact tttcattggg attgaactga gaaataacaa acaagcttta agtgggaaat 180 

taaaaaaaag aagtaaccta tgtagatcca aacttaaaat gtgagaaatt attgaaattt 240 

cattttctac aaacttgaaa ttagectget aattgtaaag ttgttttaat aatgetgaca 300 

aatgtcagtt acgtttgcaa aggagtgtat ggttctaggt atttgectae tgttaaccgt 360 

tgagaaaaac attgtcaggt tagcaagtct attgaaatag agacctcctt agtttacagc 420 

aaagaataaa tagctgatga ctggagattg ggactaaggt tttatttatt tatattcttt 48 0 

gaaagaaatc ggacagttaa taagtggttt gtggtagagt tgaaggatgt ctgagagatg 540 

gaaagagagt gacaaaggag gagaaggaat agtatttctt ttttagtatt gntttgaaat 600 

taaaactctg ntattttaat atggtaaaga gcaagaattt gggttgggcc gengtgaetc 660 

aegectataa tcccagcact ttgggaagcc ntggtgggca aatcacctga aattangg 718 

<210> 712 
<21l> 783 
<212> DNA 
<213> Homo sapiens 

<400> 712 

agttgaantn ettgetaenn aaaacctttg genactnget ctttntgnag gatcccatcg 60 

attegcaaag atggtcgtat tactaaaggt gaataaccag cgeggnnnge acgtggagtc 120 

actggaacat ttgtgcaatg ctggtgggaa tgtcaacccg tgcggccctc tggaataagc 180 

ctggcagctc ctccaagagt taccgngtga cccancaatt ccactcctag ctccacccac 240 

aggaattgaa agcaaanacg caaacagatg cctgtncacc aaagttcacg geagcatnet 300 

tcgncatagt ggcagcatcc gtcgtcacag cggcatcatc cttcatcata geggcagcat 360 

ccgtcgtcac aageggcage atccttcgcc acagnggcan gcatctgtcg teacaneggn 420 

agcatccttc gaeaaagegg cagcatnett cgtnatagcn gcagcatcct ttgecatanc 480 

eggcaaggtg gaaaccctgt ccatccactg aggegtgeat agactaaaca tgggcagtcc 540 

agcactggaa ttccaagccg tacaaeggng nccacngtca aaaangaatg aggaccctga 600 

ngcacctgng cnganaacaa gaaenngega nnccaanact tttnagacat tattgectta 660 
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agtngaaaaa cccagngcac caacgggaaa ccngaccgnc ntgnanccct gnttaacntt 
nantnngttn cccgaaaatg ggggcacntt nccaaaaagg ggaataaaag gggagaattn 
cct 



720 
780 
783 



<210> 713 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 713 

gttgaantcc ttcctttcaa atngcttggc tactcgntct ntntgcagga tcccatcgat 60 

tcgaattcgg cacgagccca catgtaccag gttgagtttg aagatggatc ccagatagca 120 

atgaagagag aggacatcta cactttagat gaagagttac ccaagagagt gaaagctcga 180 

ttttccacag cctctgacat gcgatttgaa gacacgtttt atggagcaga cattatccaa 240 

ggggagagaa agagacaaag agtgctgagc tccaggttta agaatgaata tgtggccgac 300 

cctgtatacc gcactttttt gaagagctct ttccagaaga agtgccagaa gagacagtag 360 

tctgcataca tcgctgcagg ccacagagca gcttgggttg gaagagagaa gatgaaggga 420 

catccttggg gctgtgccgt gagttttgct ggcatangtg acagggtgtg tctctgacag 480 

tggtaaatcg ggtttccaga gtttggtcac caaaaataca aaatacaccc aatgaattgg 540 

acgcagcaat ctgaaatcat ctctagtctt gctttccttg tgagcagttg tctttctatg 600 

atccccaaag aagtttttct aaagtnaaaa ggaaaattcc tagtggaatt cancccccaa 660 

gggaaaaaag cccacttgnc cacannagga agccnggntn ccccttngtt ccggcttaan 720 

ggccccttgt tcaggaaacc acactggggg ancttntttt ttttn 765 

<210> 714 
<211> 740 
<212> DNA 

<213> Homo sapiens 
<400> 714 

gtttgaannc cttngntttc naatgctngg ctacttgttc tttntgcagg atcccatcga 60 

ttcgccaaaa gcttgtggca aatttgaaat ttctgccatt agggacctta caactggcta 120 

tgatgatagc caacctgata aaaaagctgt tcttcccact agtaaaagca gccaaatgat 180 

caccttcacc tttgctaatg gaggcgtggc caccatgcgc accagtggga cagagcccaa 240 

aatcaagtac tatgcagagc tgtgtgcccc acctgggaac agtgatcctg agcagctgaa 300 

gaaggaactg aatgaactgg tcagtgctat tgaagaacat tttttccagc cacagaagta 360 

caatctgcag ccaaaagcag actaaaatag . tccagccttg ggtatacttg catttaccta 420 

caattaagct gggtttaact tgttaagcaa tatttttaag ggccaaatga ttcaaaacat 4 80 

cacaggtatt tatgtgtttt acaaagacct acattcctca ttgtttcatg tttgaccttt 540 

aaggtgaaaa aagaaaatgg ccaaacccaa caaactaaca ttcctactaa aaagttgagc 600 

ttggacatat tttgaatttt tgtaagtgaa agatttttaa actgactaac ttaaaaaaat 660 

agattgtaat tgatgtgcct taatttgcat aaatcataaa tgtatgtcct ctctgtaatt 720 

ggtttaatgt gtgcttgaan 74 0 

<210> 715 

<211> 708 

<212> DNA 

<213> Homo sapiens 

<400> 715 

tttgcaaatn gcttggctac ttgttctttt tgcaggatcc catcgattcg aattcggcac 60 



220 



WO 99/33982 



PCT/US98/27610 



gagggaggct 
actgtgtgac 
ctggtaacca 
aaggagctgg 
gctgtggaac 
aagagaatag 
ctgagaatcc 
gaaagacatt 
acttccatag 
tctaatctga 
cagatgaaaa 



agactcaagc tgtctggaga gtgtgaaaca aaagtgtgtg aagagttgta 
tgagcttgat ggccaagttg aaaatcttca tttggatctg tgctgccttg 
ggaagacctt agtaaggact ctctaggtcc taccaaatca agcaaaattg 
taccagtatc tcagagcctc cgtctcctat cagtccgtat gcttcagaaa 
gctacctctt cctttgagac cttgtggaga agggtctgaa atggtaggca 
ttccccagag aataaaaact ggttgttggc catggcagcc aaacggaagg 
atctccacga agtccgtcat cccagacacc caattccagg agacagagcg 
gccaagcccg gtcaccatca cgcccagctc catgaggaaa atctgcacat 
aaagtcccag gaggacttct gtggtcctga cactcaacag aattatagat 
tgagttactg agctttggtc ccttaaaaca agctgacttg gtccctaaac 
tccagatgct ctatacttgg ctttaagaac tgctttcn 



300 
360 
420 
480 
540 
600 
660 
708 



120 
180 



240 



<210> 716 
<211> 730 
<212> DNA 
<213> Homo sapiens 

<4O0> 716 

ttgcaaatng ctnggctact tgttcttttt gcaggatccc atcgattcgc tcccatggag 60 

gtggtgggaa tggcaccgag aagtttgatg acagttatct aatggactag aggttggcaa 120 

actttctgta aatggccagg tagtaaatag ttctgctttt gaaggcatat ggtctcttgc 180 

acctactcga ggctgaaagc agctatagac aatacataaa tgaatgagcg tgagtgtgtt 240 

ccaataagaa aaaaacatgg ctgtttgctt cggccccagg gttgtagctt accagtcctg 3 00 

taacagatca cagtttgctc ttttggtcac aaatacttga acccctccct agttcagagc 360 

atttgatacc gtaatattta aagctcactt gtaaaacatc gtttgttgcc tccatccata 420 

gtatctcaaa cagaatgtct ctcccaaata tacctaaatt ccatattctc tgaagcacaa 480 

ccagctattt tcttgacata cttcctaaca caccccacag ttcacaattt gatctgaaaa 540 

cttgttaagg gaggttcttt ggcatgtgat gccataaaaa gagaggtatg ggctctcctt 600 

taaaaaagag acccttttta tgagactcac aataggataa aagagcccat gcctattttt 660 

aaacattttt ttcactatat aagacatgca tgcctgnaaa atggttttta attagtatna 720 

ntgcttaatn 730 

<210> 717 
<211> 728 
<212> DNA 
<213> Homo sapiens 

<400> 717 

naatngctng gctcttgttc tttttgcagg atccctcgat tcgctgcagt gagattctct 60 

gcaatgactg gcctcagcaa gggggcagct taggaccctg acatcccagg tcactaagcc 120 

acataggata agtaatgggt ggacagaagc gggaaaggag aagggcaggg cacatgttta 180 

aaacttgaac tttctgaggc taagactgga aaaggaatgg tttcagctga tatatttgga 240 

taccagttga ctatttttag gaaaaaaaca caaatggctt ttaaacatca cagtgtgata 300 

cagtctaact cagaattaga gacaggcaaa acagaactcc atcttaaaaa ataaataaat 360 

aaaataaaat aaatgacatc actttggttc agagctctaa aatggaggga ggaagccatt 420 

ctaaaaagga ctccctacat gacctgcaac ttgaaaaaaa attaaaagct ccaaaaaaaa 480 

caatncagga gcttaccttg aaccttttga attgggccaa attgccgatg accactgcat 540 

cctggaaaat tttatttcac cagcactaca acttctcaac agcaccaacc aatttaacta 600 

tggatttttg tactaanccc agttgcctct ttnaaaacaa cttgtcaact ttgtctaatc 660 

accctcagct tttttttaaa aacccctnct ctacccctnt ctcttcagaa caccaaagtg 720 



gncttttn 



728 
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<210> 718 
<211> 730 
<212> DNA 

<213> Homo sapiens 



<400> 718 

gaantccttn nntttnaaat cnttggctac ttgttctttt tgcaggatcc catcgattcg 60 

aattcggcac gatctagata ttgcccaatc gctgcccaca gtgcacatac ctttccacca 120 

gtcacatgtg agagggcaga ttttccaaat gctcatcacc acttggcact gtgtggacta 180 

taattttggc cagttaggaa atggcatctc attgttttca tcttaatttg cgtcagcctg 240 

attactcatt gaaacttgtg aggttgagaa acttttctta agcttattgg ccattcaagt 300 

ttcctccttt atgaaatggt tgttcatgtc atttgctcat ttttatatta gattgttttt 360 

cttttttcca gctgacttgt aggaactcta catcttatca atattaatca tttatcgaaa 420 

actatttggg tgccattatc ttctcctagt caatgttttt tgtttgtgat atcttttata 480 

atatataagt ttttaatgtt ggcagaagta aagttaatct ttttggctgt gttgtgtgtc 540 

ttgtttgatg taaagatagt ttctgtaata gttttgcagt ttgattggtc atctttaggt 600 

cttcaattac aacctgcaca ttcatccctc tatcctcttt cttactctgg ttttctccat 660 

agcacttatc atccaataat atggcatgca cttatttaat ctggtttgca tatatatttt 720 

ngctggtacg 730 

<210> 719 
<211> 733 
<212> DNA 
<213> Homo sapiens 



<400> 719 

ttcaaatcgc ttggctactt gttctttntg caggatccct cgattcgctt cagtgcacac 60 

aacaggagag aggagaaaga agaaacgcta gtaattccaa gcactggaat taagttgcct 120 

tcatcagtgt ttgcttcaga gtttgaggaa gatgttggat tgttaaataa agcagctcca 180 

gtttcaggac ctcgactgga ttttgatcct gacattgttg cagctcttga tgatgatttt 240 

gactttgatg atccagataa ttctgcttga ggatgacttt attcttcagg ccaataaggc 3 00 

aacaggagag gaagagggaa tggatataca gaaatctgag aatgaagatg acagcgagtg 360 

ggaagatgtg gatgatgaga agggagatag caatgatgac tatgactctg caggcctatt 420 

gtcagatgaa gactgtatgt ctgtgcccgg aaaaactcac agagctatag cagatcactt 480 

gttctggagt gaggaaacaa agagtcgctt cacggagtat tcgatgactt nctcagtcat 540 

gaggagaaat gaacagcttg accctacatg atgagangtt tgagaaagtt ttatgagcca 600 

tattgatgat gatgaaattg ggagctctgg ataatgccag aatttggaaa ggttctattc 660 

aagtgggaca gcaattcgct ttcnaggaag ttttgaatga ctactattaa agagaangcc 720 

caanaattnt ntt 733 

<210> 720 
<211> 740 
<212> DNA 
<213> Homo sapiens 



<400> 720 

agttnnnttn ntnctnttca aatccttggc tacttgntct ttttgcagga tcccatcgat 60 

tcgaattcgg cacgagaaga gaaggaccta gagattgaga ggcttaagac gaagcaaaaa 120 

gaactggagg ccaagatgtt ggcccagaag gctgaggaaa aggagaacca ttgtcccaca 180 

atgctccggc ccctttcaca tcgcacagtc acaggggcaa agcccctgaa aaaggctgtg 240 

gtgatgcccc tacagctaat tcaggagcag gcagcatccc caaatgccga gatccacatc 300 
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ctgaagaata aaggccggaa gagaaagctg gagtccctgg atgccctaga gcctgaggag 
aaggctgagg actgctggga gctacagatc agcccggagc tactggctca tgggcgccaa 
aaaatactgg atctgctgaa cgaaggctca gcccgagatc tccgcagtct tcaacgcatt 
ggcccgaaga aggcccagct aatcgtgggc tggcgggagc ttcacggncc cttcaccagg 
tggaggacct ggaacgcntg gagggcataa cngggaaaca gatggagtcc tttctgaagg 
caaacattct gggtctcggc ggccgccanc gctntggcgc cttctgaccg tcgctnctac 
ttncgncttt tcaaattttt ggnataaccc ccgtgtttgn gtaaaatcca gtttttgttc 
cgntaaaaaa aaaaaaaaat 



360 
420 
480 
540 
600 
660 
720 
740 



<210> 721 
<211> 736 
<212> DNA 
<213> Homo sapiens 

<400> 721 

nnttnnnttt tnnaaatccc ttggctactt gttctttttg cagggatccc atcgattcgc 60 

atgagtgata ttttggtctg ggtttcctct taagatttta gtttgtctga attaaggaaa 120 

aatgttttta atatacattc ttattttgtc ccacccctcc agaaataagc tggaaatctt 180 

aactttttgg ggggtctttt ttggtgtttt aatgggccca gaactgtggt ttaaattttt 240 

atgtatgtat tttctttttt gtggagtata aatttaaaaa ctggatttgg gacctaaaat 300 

actcctcagg ttgatgtatt catgaagttt taaaacatct ttagttttca aagtaaactg 360 

gatatgtgga ccttaaagtt attgagttta agctacaaat tgtaacgtca ttactggaca 420 

tgtcagcatc aaccctctca aaatagcttg gtcactttat gaaggggcgt tttaaagttg 480 

ttgtttagca gtgacattta atatggtcca attgcttttc tttttaacgt gacaaaaaga 540 

gaataaggaa caaacactat tgctgccgaa tgccataaca ctgagttgtc aaattgtgat 6 00 

tgaggaaatg aaaaggttta tactttttaa aaaaaaaaaa cnnaanccaa aaaaccaaaa 660 

cttcaaatgg aataaattat tcatgaagcc cttaaaaaaa aaaaaaaaaa aactcgaacc 720 

tntaaaactn tngngg 736 

<210> 722 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 722 

attnccttgg cttttcaaat ccttggctac tngttctttn tgcaggatcc catcgattcg 60 

aattcggcac gagattatag agattaatct cctttgctcg aagtctattt aaatattagt 120 

cacatctaaa acatactttt acagcaacat ctagactggt gtttgaccaa acaactgggc 180 

atcatagctg acacataaaa ttaaccatca caaccatgtt ctaggcactg ttcctcactg 24 0 

cctgagaaga caccgttatg tttattaggg tttttgagtt ttatccacag cttttggtta 300 

tctgcaacca tgtctcccac cattaacata gttcacactg agatgaggat tccctattta 360 

acacttggtc ccaacttctt cacagtccat ctggttttgt agagggaaca taactggaca 420 

ttctggtcag gttaggtgag gtcaggcctt caggacgcta ttttcactga gttgctttat 480 

aaggcacatt atgcaaaatt ccatcagctc ttctgttcac tacattcact gttgaaattc 540 

taagagtgag actgctgtct cacaccaaag ccagtgggta ctatcttcag taggcacgca 600 

gcatcatgtt tgtatttgat ccanctagat gacatgtaag agaaaacttt attgnggact 660 

ctgtaaagtg tgacattcgt ttgtgactca atttgctcat gtatttggtc ctggggagtc 720 

attacatagc taactttcag ctgctttcaa t 751 



<210> 723 
<211> 749 
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<212> DNA 

<213> Homo sapiens 



<400> 723 

tttaatncct ttcnntaatc cttngtttcn ngcnctttnt gcaggatccc atcgattcga 60 

tgctagccaa agcctgctgc cagctccata gcctggacct acagcactcc atggtggagt 120 

ccacagctgt ggtgagcttc ttggaggagg cagggtcccg aatgcgcaag ttgtggctga 180 

cctacagctc ccagacgaca gccatcctgg gcgcactgct gggcagctgc tgcccccagc 24 0 

tccaggtcct ggaggtgagc accggcatca accgtaatag cattcccctt cagctgcctg 300 

tcgaggctct gcanaaaggc tgccctcagc tccagcctgg accttgcccc caggtgctgc 360 

ggctgttgaa cctgatgtgg ctgcccaagc ctccgggacg aggggtggct cccggaccag 420 

gcttcctagc ctagaggagc tctgcctgnc gagctcaacc tgcaactttg tgagcaacga 480 

ggtcctnggc cgnctactcc acggctctcc caacctgcgc ttactggatc ttcgtggctg 54 0 

tgcncgcatc acgccggctg gccttcagga tctgccatgt cgggagctgg agcagcttca 600 

tctgggcctg tatggcacgt cagaccggct gacttttacc aangagggca agnccctttt 660 

gaccagaant ggtgcataca ctgcgaagaa ctggactttg aatggccaag ggttcaattg 720 

agaaagacct ggaacangcc cttgctnct 749 



<210> 724 
<211> 761 
<212> DNA 
<213> Homo sapiens 

<400> 724 

ttnnnnccct ttttaatncc ttctactaat ccttggctct cgntctttct gcaggatccc 60 

atcgattcga attcggcacg agcctcagcc ttctaaaaag ctggggctac acccagctga 120 

agaaattgta actaaagata gattgtttaa agcaaagcaa gaaacttctg aagaaatgga 180 

acaaagtgga gaagcctcag gaaagcccaa cagagagtgt gcaccccaga ttccttgtag 240 

tactcctatt gctactgaaa ggacagttgc acatttgaac actctgaagg accgtcaccc 300 

aggtgatttg tgggcccgca tgcacatctc atccctggaa tatgctgcan gagacattac 360 

ccgaaaaggg agaaaaaaag acaaagctcg agtgagtgaa ctgctccaag gcctctcatt 420 

ctctggtgac tcagatgtgg aaaaagataa tgagcctgag atccagcctg ctcaaaagaa 480 

gttaaaggta tcatgtttcc cagaaaagag ttggaccaaa agagacatta aacccaattt 540 

tccaagctgg tcagcactgg attctggact tttgaatctc aagagcgaaa agtttgaacc 600 

cagtagagct ttttgaatta ttttttgatg atgaaacatt caacttaatt gtcaatgaaa 660 

ccnataatta tgcttctcag aaaaatgtca gctttggaag tccagttcag gaaaaaaaan 72 0 

nnnnannaaa aaactcgagc ctntanaact atngtgagtc c 761 

<210> 725 
<211> 760 
<212> DNA 
<213> Homo sapiens 



<400> 725 

tttcnccccn tttttanccc cttnctctaa tccttggntc tngttctttt tgcaggatcc 60 

catcgattcg aattcggcac gaggcggact ctcaggacga aaagagtcaa acctttttgg 120 

gaanttcaga ggaagtaact ggaaagcaag aagatcatgg tataaaggag aaaggggtcc 180 

cagtcagcgg gcaggaggcg aaagagccag agagttggga tgggggcagg ctgggggcag 240 

tgggaagagc gaggagcagg gaagaggaga atgagcatca tgggccttca atgcccgctc 3 00 

tgatagcccc tgaggactct cctcactgtg acctgtttcc aggtgcctca tatctcgtga 360 

ctcagattcc cgggactcag acagagtcca gggctgagga actgtccccc gcagctctgt 420 
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ctcccttgct agagcccatc agatgctctc accagcccat ttctctactg ggctcctttt 480 

tgactgagga gtcacctgac aaggaaaaac ttctatcagt actttgatat gtcacagttt 540 

catgtttatc cagttcaatg tatttttaaa tttttccttg agacttcttt gactgataga 600 

ttattgtgaa gtgtgttttt aaatttncaa atgtttangg attttcatat ctttcttatg 660 

ctgatttcca attggattcc ttacaatgat ttttgggttt catctgctct tggatgatta 720 

ctatctcttt taaatttggt gtggccaagt tttagggccn 760 

<210> 726 
<211> 741 
<212> DNA 

<213> Homo sapiens 



<400> 726 

ttntgccctt tgtntnatcc ttgntcttgc ctttttgcag gatcccatcg attcgaattc 60 

ggcacgagac aagttctatt gagtgctatt cagaatagga acaaggttct aatagaaaaa 120 

gatggcaatt tgaagtagct ataaaattag actaatctac attgcttttc tcctgcagag 180 

tctaatacct tttatgcttt gataattagc agtttgtcta cttggtcact aggaatgaaa 240 

ctacatggta ataggcttaa caggtgtaat agcccactta ctcctgaatc tttaagcatt 300 

tgtgcatttg aaaaatgctt ttcgcgatct tcctgctggg attacaggca tgagccactg 360 

tgcctgacct cccatatgta aaagtgtcta aaggtttttt ttggttataa aaggaaaatt 420 

tttgcttaag tttgaaggat aggtaaaatt aaaggacatg ctttctgttt gtgtgatggt 480 

ttttaaaatt tttttttaag atggagttct tgttgcccag gctagaatgc aatggcaaaa 540 

tctcactgca atctcctcct catgggttca agcaattctc ctacttcagc ctcccaagta 600 

gctgggatta caggcatgtg ctaatttggt gtttttaata gagatgaggg ttttccatgt 660 

tggtcangct ggtctcaaac tcctgcctta ngtgatcgcc tcggcctnct aaagtgctgg 720 

aattcaggca tgaancncca t 741 

<210> 727 
<211> 751 
<212> DNA 
<213> Homo sapiens 



<400> 727 

ccttcttccn aangctntgt tgaacncctt tcnnnatcgc gcttgcgctt tgagctagga 60 

taaaaattgg gtaaagggac atttgcttac ctgnntnatg aatcactntt tgaaatgtga 120 

tcttgccata tcatcaagaa acttgttttc tggatgaata ctgggagaat aaaatgagaa 180 

ctctggagtg agctaaattg atcccaatna agtttttctg cttagcagac agaaggtata 240 

attntttgac accctttccc acctggtgcc tatgctaggc ttgtcctgan aacatncctc 300 

agtaacttga tattcacatg acctacagga tgtcccatct gcagggctga gtcagttggg 360 

gaacaccaga ggctacacag tagctattcc tgctactcgg ttaatgagct tggcaggttc 420 

tttgtctcac tgaattctta tcatggaaac agcagcagca gccgctagga aatcttcaag 480 

tgtagnggcc tgtgctaacc cagtggtaaa tcccttagat cccctgctgg tctctggcaa 540 

aactccttga tnttgggtac catgtatant ttgcctttga cntttaacgc tttctacgat 600 

anggtaanca cncntttaat ttangcnctg gancattaac tttctttgca aaggctactt 660 

atngccngnc acaantgcag cctcggacan ancnnangnn atatcctgtt ggccatggct 720 

ntgatgtttg acanccgata ngccttctnc g 751 

<210> 728 
<211> 765 
<212> DNA 
<213> Homo sapiens 
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<400> 728 



tngnntttnt ttaacnttgt 
cctgcagatt gcagagcggc 
ggacatgctg gagcagatgg 
cctgaactct ggaggagaag 
tacctgtcac aatganatta 
nccttcttnt tcctncacct 
gtnccncnnt gttcagtccg 
acacactnac agagangcca 
atgggcgtng tntggccaca 
aagccaggtt nnttnataga 
aacttgtgng aacantttaa 
natnnagggc taaacaagct 
ttctnaaaat nttcaatntn 



ttgacgcctt tctgcaggat ccctcgattc gcactggcta 
gagagcccat aggcagcatg tcatccatgg aagtgaacgt 
acctgatgga catatcggac cangaggccc tggacgtctt 
agaacactgt gctgtccccc gccttagggc ctgaatccag 
ccctccaggt tccaaatncc tcagaattaa gagccaancc 
gcaccgactn ggncacccng nacatcanng agggtgggga 
atgaggagga anttcangtg gacactgncc tgnccacatn 
ctcnngatgg tgntnangac agcaactntt aaattgggac 
ctggaatcca nntttggctg tatgcggaat ttcacctgcn 
cgttcttgat tattacataa ttgccaatca tgtggtgagn 
caattaantg tgaagaccgt acaangaatt agttaaangc 
attacttntg annnaantta angnatntaa nntttnctgn 
nngggaacan ttgtaanttt nncnt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
765 



<210> 729 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 729 

tannnnttnc tntannnttt ctgangccct tntgcaggat cccatcgatt cgaattcggc 6 0 

acgaggagat ctctgggatg tcagtgaggc tggttgaaga ccagaggtaa actgcagagg 120 

tcaccacccc caccatgtcc caggtgatgt ccagcccact gctggcagga ggccatgctg 180 

tcagcttggc gccttgtgat gagcccagga ggaccctgca cccagcaccc agccccagcc 240 

tgccacccca gtgttcttac tacaccacgg aaggctgggg agcccaggcc ctgatggccc 300 

ccgtgccctg catggggccc cctggccgac tccagcaagc cccacaggtg gaggccaaag 360 

ccacctgctt cctgccgtcc cctggtgaga aggccttggg gaccccagag gaccttgact 420 

cctacattga cttctcactg gagagcctca atcagatgat cctggaactg gaccccacct 480 

tccaactgct tcccccangg actgggggct cccangctga nctggcccag agcaccatgt 540 

caatgagaaa gaaggaggaa tctgaacctt gggtaaggat ttggggcaca gtaccaggaa 600 

gggggcttgg tgccagacct tatgaggaag aaggattttc ctatgtacag agaangggac 660 

cctgtnctgt tgggaagtgc ttgtgcaaac ctaaccaagt tactaacccc tctgntttct 720 

gtgctacaca aaggggataa att 743 

<210> 730 
<211> 744 
<212> DNA 
<213> Homo sapiens 

<400> 730 

tttnttcctt cctctaatcc ttttancgcc tttctgcagg atcccatcga ttcgaattcg 60 

gcacgagggg tcctccaaga gtttggggcg cggacnnnag taccttgcgt gcagttatgt 120 

cggcgtntgt agtgtntgtc atttcgcggt tcttacaaca gtacttgagc tccactccgc 180 

agcgtctgaa gttgctggac gcgtacctgc tgtatatact gctgaccggg gcgctgcagc 240 

acggttactg tctcctcgtg gggaccttcc ccttcaactn ttttctctng ggcttnatct 300 

cttgtgtggn tgagtttnat cctagcggtt tgcctgataa tacngatcaa cccacngaac 360 

aaagcngatt tccaaggcnt ctgcccagag cnagcctttg ntgannttct ctttgccagc 420 

accatcctgc accttgttgt natnancnta ggtgnctgaa tcattctcan ttncntaatt 480 

gangagtang anactaaaag aatgttgact ctttgaatct gctggataag agactngaga 54 0 

tggcagctta ttggacacat ggattttctt cngatntgca cttactgcta gctntgctan 600 
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ctatgcagga gaaaagccca tagttactgc gtgtnacaac aactntctaa cnaacattca 
ttaatccann ngannccttt caangaatgg taancctatg ccnttcaana tactgaactt 
nntgccactt ntggcaaaaa aaat 



660 



720 
744 



<210> 731 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 731 

cttattccct ttgnaactna ctctttntca tccctttgtg caggatccca tcgattcgaa 60 

ttcggcacga gtgtccttat ctgaaattca gcgatcttnt tgaataagca tttctctgat 120 

tgtggtatat gcctttaatt ttatttctag agtgacaaat ttttggtttt gacagttttt 180 

ttctagcttt atagtttctt cttggggaga gaatatgtca acctcactcc atcatgctga 240 

agtaaatctt catctcttaa ttttatctct caaaaatatc ctaaggattc cctctggagc 300 

ctgataagta attgcagtat ctggtttcta tggttggatg attcaggatt ccaggaataa 360 

tagttacttt ttagacctct aaagaagaag taacaaccac gtaaatgaaa agatgcttct 420 

taaatcatgg agaatcaggg cttagtatca ctgtattttc aaactgtttc agccttactt 480 

tataactgat ttagtatatt tttcttttaa tttcagactt cagtgaagtt ccttatgact 54 0 

tcccctgaaa ttgcttcctt atcatggggg caaatgaaag taaaaggctc taatacaacc 600 

tataaggact gcaaagtatg gccagggggt agtcngactt gggattggag agaaacagga 660 

actgagcatt ctcctggtgt gcacctgcag atgtgaagga agttgttgag aanggtgtcc 720 

agactcttgt gattggncna nggata 746 

<210> 732 
<211> 756 
<212> DNA 
<213> Homo sapiens 

<400> 732 

ttnnnnncnn nnatcctttn gatttnattc ctntntcang tcctttgtgc aggatcccat 60 

cgattcgaat tcggcacgag gtggcccata agttttacct tttaaacatc cggctgcctg 120 

tgaatgagaa gaagaaaatc aatgtgggaa ttggggagat aaaggatatc cggttggtgg 180 

ggatccacca aaatggaggc ttcaccaagg tgtggtttgc catgaagacc ttccttacgc 24 0 

ccagcatctt catcattatg gtgtggtatt ggaggaggat caccatgatg tcccgacccc 300 

cagtgcttct ggaaaaagtc atctttgccc ttgggatttc catgaccttt atcaatatcc 360 

cagtggaatg gttttccatc gggtttgact ggacctggat gctgctgttt ggtgacatcc 420 

gacagggcat cttctatgcg atgcttctgt ccttctggat catcttctgt ggcgagcaca 480 

tgatggatca ncacgagcgg aaccacatcg canggtattg gaagcaagtc ggacccattg 540 

ccgntggctc cttctgcctc ttcatatttg acatgtgtga gaaaggggta caactnacga 600 

atcccttcta cagtatctgg actacagaca ttggaacana gctggccatg gncttcatca 660 

tcgtggctgg aatctgcctc tgcctctact tcctgtttct atgcttnatg gnatttcaag 720 

tgtttcngac atcantggga agcaatccac ctgccn 756 

<210> 733 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 733 

cntatccttt nntttattcc ttnataagnc cttnngcagg atccatcgat tcgaattcgg 60 
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cacgagctca 
aagagcctgc 
cctacgtgcc 
agcagcagca 
gcttggcgca 
tctcactgct 
ggaatgcgtt 
tggtgaggag 
acaagcttcc 
gtggactttt 
aaaaaaaaaa 



cacctgcttt 
aagtgacaaa 
tcggattctg 
gcagacctat 
gagcgccacg 
ctgaaaagac 
gggactgccc 
agcacaggca 
agctctgcag 
cgccccccaa 
nnnnnnnnnt 



ggatgcttca agcacctcag ccctctgaac tacaaaacag 
ggaagtgagg cagaggccca catgccccca ccgttcacac 
aacggcttgg cctcggagag gacagcactg tctccgcagc 
ggtgccatcc acaacatcag cgggactatc cctggacagt 
ggcagtgtgg ctgctgcccc ccaggaggcc tgaggctggg 
acaaccagaa tggcctgggg ctcaggccct tggctgagtg 
agctgagcta tcaggtgccc atcttttctg gtcccagcag 
ggcctcgccc ctcccttgct cacccagttt cccctncggc 
ctggggtgac atccccagtg gtttgtcgcc aagacatgtg 
actgatgagt nccggagaat atatggagag agagatgtaa 
nntnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
tc 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
742 



nnnnnnnnna 



annnnananc 



<210> 734 
<211> 749 
<212> DNA 
<213> Homo sapiens 

<400> 734 

nntanaatcc ntttnnctnt aatccctcta ncaaatccct tgggcaggat cccatcgatt 60 

cgaaaattta tagtaatgac aaatgactta tcagtgttca tcatctgaaa gctaagtggt 120 

tcgttcaatc actttttcaa agttgatagt agattgcatg gtttcatgtt tcctcatatt 180 

ggtttattaa ttctatttaa tcaaggaaaa taacttcaga ttccataaag tttcagttta 240 

tttttagttt actactaggt gagatagcac attacatact tttactatca aatattattt 300 

tagcagcttc ccatagtacc aaatgatttg attccctact ctcatttttt aaagcatata 360 

aatatttatg ggcttaaaaa gggggttttt aaaaactgag gatatcagta ataaattgca 420 

gaatattttg caaagctttc ttttggaaag caaacttttg tgcctgccta tatgcaaagt 480 

attttatcag ggacttgaac aaagacctca ctctttttca cttgtcttat gtcgagagaa 540 

aagggtattg gcagccacat tcctaagact ggggaatggt gtgtcccttt aaatttgaag 600 

ataactttan gtaattatng gaactcctca aagaggagaa agtaattttt tncagacatt 660 

ttctcaatct gggnctttca cacactantt tncatagtcg agaatctggt tttacccatt 720 

gggctgngaa tgtccaatat cagtcctgg 749 

<210> 735 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 735 

gngntnngnn gttnnnttnt tttnaatnta atccttgtnt naantccttt tgcaggatcc 60 

catcgattcg aattcggcac gagggtgccc atcaccacac ccagctaact tttgtatttt 120 

tagtagagac ggggtttcac catgttggcc aggctggtct tgaactcctg acctcgtgat 180 

ccgcccgcct tggccccgca aagtgctggg attacaagca tgagcccagc gcctggctgt 240 

atctttcatt ttacccaagt cactttaccc aagtaagtaa ttaggggaaa gcctgagtct 300 

tgtaccacct gttcatttgg ggaactgtgg gaaacggagc caacggacct aagtgccctt 360 

tgacagtgag tttcatacca tttcagtagt gtatttcttt cttaatctga ataaaccaga 420 

atgatactct cagcacagaa gaataaaggg agcgagtcat taacgttntc tttttaaacc 4 80 

tttatgatga cttncttatg aattactgaa cgaacactgg aatgggactc acgtatcctg 540 

aggacatctc tcaactctgg ccttantttc ccctctgtaa aattagggtg ccaactaaat 600 

gatctacaag gtccctttnc aagcgcccgn cattctgtaa ttacatcatg tggaactgna 660 

ttaaacatac accagtgaac tggcangcat tgggaatgta actttcccag taaaatgctt 72 0 
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tnggtttggt tcaaaataca ctntgaactt cttttcaaag acnggttnng 770 

<210> 736 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 736 

tttnnctttt attcaaatnc ttgcnggatc ccttgattcg aattcggcac gagggatgnc 60 

catcgatgct natcnggcac gaggtgatgn cagcttgcaa actggtctac atnncaaact 120 

gatagtacat tgccatctnc aggaagactt gacggctttg ggattttgtt taaactttta 180 

taataaggat cctaagactg ttgcctttaa atagcaaanc agcctacctg gaggctaagt 240 

ctgggcagtg ggctggcccc tggtgtgagc attagaccan ccacagtgcc tgattggtat 300 

agccttatgt gctttcctac aaaatggaat tggaggccgg gcgcagtggc tcacgcctgt 360 

aatcccagca ctttgggagg ccaaggtggg tggatcacct gaggtcagga nctcgagacc 420 

agcctggcca acatggtgaa accccatctc tactaaaaat acaaaaatta gccangtgtg 480 

atggtgcatg cctgtaatcc cagctcctca gtaggctgag acaggagcat cacttgaacg 540 

tgggangcag angttgcagt gagcccgaga ttgcaccacc gtactnnaac ctgggtgaca 600 

gagcgagact tatcttatan ataaatagat ngatcttcac ctgggtgaca naacgagact 660 

tatagataga tagatagata gatggataga tngatngatn gatagataga ttgataaacg 720 

gaattgggcc ttttgcttta atgaaa 746 

<210> 737 
<211> 751 
<212> DNA 

<213> Homo sapiens 
<400> 737 

ntnnnncttt ttgatcantc ctttnttgga tcccnttgct acttgttctt tttgcaggat 60 

cccatcgatt cgaattcggc acgaggctga cctacagcag aagctgctgg atgcagaaag 120 

tgaagacaga ccaaaacaac gctgggagaa tattgccacc attctggaag ccaagtgtgc 180 

cctgaaatat ttgattggag agctggtctc ctccaaaata caggtcagca aacttgaaag 24 0 

cagcctgaaa cagagcaaga ccagctgtgc tgacatgcag aagatgctgt ttgaggaacg 300 

aaatcatttt gccgagatag agacagagtt acaagctgag ctggtcagaa tggagcaaca 360 

gcaccaagag aaggtgctgt accttctcag ccagctgcag caaagccaaa tggcagagaa 420 

gcagttagag gaatcagtca gtgaaaagga acagcagctg ctgagcacac tgaagtgtca 480 

ggatgaagaa cttgagaaaa tgcgagaagt gtgtgagcaa aatcagcagc ttctccgaga 540 

gaatgaaatc atcaagcaga aactgaccct tcttcaggta gccagcagac agaaacatct 600 

tcctaaggat acccttctat ctncagactc ttcttttgaa tatgtcccac ctaagccaaa 660 

accttntcgt gttaaagaaa agttnctgga caaaacatgg acatngagga tctaaaattt 720 

ggtcanagca tctgtgaatg agcatganga t 751 

<210> 738 
<211> 795 
<212> DNA 
<213> Homo sapiens 

<400> 738 

aatccctttg ctttaancct tgtttgaacc cctttggaac tncctctntn tgnaggatcc 60 

catcgattcg aagagcncan gcaggaagag agagaccctn actgctgggg anttnctgcc 120 

acactcaagt ccccaaccca ctggaatctc ccctactaca agtgccatgt anaccccttg 180 
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aaaaggggag gggcctaggg agccgacctt gtcatgtacc atcaataaag taccctgtgc 240 

tcaaccaaaa aganaantan anaaaactcn agcctctaga actatagtga gtcttattac 300 

gtagatccag acatgattng anacattgat gagtntngac aaaccacanc tcgaatgcng 360 

tgaaaaaaat gcnttatntn tgaaanntga natgctatat nnntcattnn ttaccattnt 420 

antctgcagt aaacaaantt tacagcancn nttgnntnga tttcatgtnt caagttcaag 480 

gnganntgtt tggcgttnat ntaattcggc ccnacncgng acccttttgc attgggcccn 54 0 

nnacccanct ntagttccct nttagngagg ggnaattgcg cnctttggcg taataatngg 600 

gcanangctg nttttcccnn tgtnnaaatt ggtttatcca gtttannaat ttcaacacga 660 

tnaatatcaa acccggtaag cnattaaatg gtnaaaaacn ntgnggggng cccttaanga 720 

gttgaactta accnganatt aaattgcnnt tncgcnttna atntcccncn ttttaaatcc 78 0 

nggaaaacct tcccc 795 

<210> 739 
<211> 763 
<212> DNA 

<213> Homo sapiens 
<400> 739 

ttnnnnncct catnaatccc ttctttgatc cctcnccnca aaacccttgg cnactcgctc 60 

tttntgcagg atcccatcga ttcgaattcg gcacgaggca nccttcgcct cctgggttca 120 

agtgattctc ctccctcaca tcccaagtag ctgggactac aggcacgtgc caccacaccc 180 

agctaattnt tgcattttta gtacaggcag ggcttcatca tgttggccag gctggtctca 240 

aactcctgat ctcaagtnat ctgcccactt tggcctccca aagtgctggc attacaggaa 300 

tggagccacc gcgcccagcc tgatttcttt anntangtct tgtcangaaa natattgant 360 

ctnttgattc ntnaacatgg cnttnggtcg tctttaatnn gnctcatcan tgcctccatg 420 

tgttnttgat gccttngaac tggtattttt aaaatnncaa tttctaattg nnnattatnn 480 

aaacacaatt gggntnnata tattggcatt gtattaatgc aactttccta aactcactag 540 

taattctagt agcntnantt ggtanattct taaggatttn ctgngtnaat agncatgtca 600 

tctgtgaatn aagccattct ttganccttt tcaaattttg agccttgtat ttcttattct 660 

taccatatca cattggcaaa gacctccagt atganattga ataaangtgg tganagaaaa 720 

caccctncta aaantgctng aattacaggc atgaaccacc ntn 763 

<210> 740 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 740 

tnnnnnnnnn tttttnaacc ntttnttgna tncntctntc aaatcgcttg gctacttgtt 60 

ctttttgcag gatcccatcg attcgctagc ctgggcaata tagtacgacc ctgtctttac 120 

taaaaatgca aaaattaacc acgtatggtg gctcacacct gtagtcctgg ctactgagga 180 

ggctgatgca ggagaatcat ttgaacccag gaggtcaagg ctgcagtgag ctatgattgc 240 

accactgcaa tccagcctgg acaacacagt gagaccctgc ctcacaaaaa ttatattctg 300 

attttctgag tccatgaaca cattgtccaa atggattttt ctagctcctc caagttacag 360 

atagttccac gcacacacag aactcaccac tctcaaatat tttccccact agtattacta 420 

ttaaattttt caaacatgca aaagatgaaa gaattgctca gtgaacacca tgtacccacc 480 

acctagattc tacaattaac attttaccct actttcttta tcacatatat gtacctatcc 540 

atctatccat tcttccatga atccatcaat tcatctaatt ttttatatat ttcaagttaa 600 

gttgcagata tgtagcttat gtttcacctt aaatgtttct gcctggctat tattaactgg 660 

agtgcaatat gtttttggnt cttctttatg gtaaaatcta tgttcagtga aatgcacaag 720 

acttangtat gccattaata gggtttgacg aatagacaaa ccttn 765 
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<210> 741 
<211> 753 
<212> DNA 

<213> Homo sapiens 
<400> 741 

ttnganccnt tnnntnnttn nntnaatgaa gccatttgct acttgntctt tttgcaggat 60 

cccatcgatt cgaggaaggt ggaggggcag gnaacaggac ggacaggccc cgggctctgg 120 

cacatcctgg ggaacaaggg accacaagga cgggggcagt ctccagactt cccctgggcg 180 

cttgacccca ggccttgcag gggagagagc cagggcctcc ctcaggtctt tgttcatgct 24 0 

gttttccctg ccgtggacac cctttcccgc tctccgattc tctaaatcct gccccatctc 300 

ccagatcttg ttcatgtcca agcttttcca ggaagtctta gcagctccca caccgcagag 360 

ctcgagatgt ctccctgact tggtcccaga ccccaactat gtgcaagcat ccacttatgt 420 

gcagagagcc cacctgtact ccctgcgctg tgctgcggag gagaagtgtc tggccagcac 480 

agcctatgcc cctgaggcca ccgactacga tgtgcgggtg ctactgcgct tccccancgc 540 

gtgaagaacc agggcacagc agacttctnc ccaaccggca cggcacacct gggagtggca 600 

caactgccac cagcattacc acagcatgga cgagttcanc cactacgacc tactggatgc 660 

aaccacaggc aaanaangtg gcccanggcc acaaaggcca atttctgnct ggaggacanc 720 

acctgtgact tnggcaacct naaacgctat gen 753 

<210> 742 
<211> 767 
<212> DNA 

<213> Homo sapiens 
<400> 742 

tnganccttt cgnttctnen ctcctaagcc tttgetaett gctctttttg caggatccca 60 

tegattcgea ggacatggag cagtacctgt ccactggcta ectgeagatt geagagegge 120 

gagageccat aggcagcatg tcatccatgg aagtgaacgt ggacatgctg gagcagatgg 18 0 

acctgatgga catateggae caggaggccc tggaegtett cctgaactct ggaggagaag 24 0 

agaacactgt gctgtccccc gecttangge ctgaatccag tacctgtcag aatgagatta 300 

ccctccaggt tccaaatccc tcagaattaa gagccaagcc ancttcttct tcctncacct 360 

gcaccgactc nggcacccgg gacatcagtn agggtgggga gtcccccgtt gttcaanccg 420 

atnaggagga agttcaggtg gacactgccc tggccacatc acacactgac aganaggeca 480 

etceggatgg tggtgaggac agncactctt aaattgggac atgggcnttg nctggccaca 540 

ctggaatcca ngtttggctg tatgengaat tncacctgga aaagecaagg ttggtntata 600 

ganggtcttg atttttacnt anttgncaat aatgggttga gnaaacttaa agaaccagtt 660 

taacaataaa atngttaggg accegttnan aaaatggang tctnccttcc atntnaacct 720 

ggannccttn aaacntttnt gngtccnaat tttcgttnca tccannn 767 

<210> 743 
<211> 768 
<212> DNA 

<213> Homo sapiens 
<400> 743 

naancctttc nnncttcgen attcnaanng ntnggaaagc tcantcgctc natagngenn 60 

gggcttcgcg agnnntggga natnacanag gctngttanc ataccngttt ttnactgean 120 

aggnnnccac angcagcatg geccatgnna tgnccatgcc antgatggen ggnggccatg 180 

ctgtcagcgg annegcaett gtgaggancc nntntggann cngtanncna canncacccc 24 0 

cagtctggna cccnagtgtt cttactacac caantgaaac gctggnnagc caagagcccn 300 
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gatggcccac gtnccctgca tgganccccc tgancngact ccaccagcct atacangngg 360 

aagccanaag cagctgtttt cngccntgcc ctgctgataa tgccttgaag accccatacg 420 

acctnnacgg nctacattga cantnngact gtgncancct ngatcagatn atcctggaac 4 80 

tgggnccnng attccaggan cttnccntca atggacctgg gngcttgtaa tcngttntgg 540 

accatacanc cnttgtanna gataaaagan ngaggaaatc tgaaaccntn gnaataagat 600 

ctgnggcatt agtnnntcaa ggggaggntn ggtnncaaaa cnctatgagg aagaacgatg 660 

gnactatgtc catgnaaggg gaacatntan tgttgganna tgcnatgcaa ncntnnccnt 720 

gatntaacnc tttganaaac tnangcttna caaaggggga aaaanact 768 

<210> 744 
<211> 757 
<212> DNA 
<213> Homo sapiens 

<400> 744 

tnnnnncnnt tnnnnttnat ncntctctca aatcgcttgg ctacttgttc tttttgcagg 60 

gatcccatcg attcgcttga cctctgtact ttaaaggaaa tcactaacca aattttcaaa 120 

gtttcctttt aaatgcgttt agctagaaat ctatgtattt atccctttcc tattttgcat 180 

tcttctccca ctatttttaa aaactcattt acagtagaaa ccattcttct ttctcccaac 240 

agtatccttt gccaagacca tgagaacagt aaggagcatg ttgttggtca gggtttcaga 300 

atacgcgtga tgtcactgag aatgtttgct cacagtcaat aattgtcttt gtggatgtga 360 

taattttgga gatacacttc tggtcagaac tcaggtgaga taatcttgca atactccaaa 420 

tgcagatact ccagccaccc gcaaggttcc aggaaaggac aatgtcctgc gagaaaatca 480 

ggaggcctcc acttcctggg ccacttgaga agttcctggg catgtcacta catgttggtt 540 

gactcagcca tttctcatgc tgntttgttt cttgcggtgg ccacttaacc ccaaagaatg 6 00 

aanggaggat ccacagtgaa agtgcctgag tttctctatg agaccagatg ctgtcgaaac 660 

caaacatctt ttcctttgct ctatnggaac attttaaggg ttggtttgca caactggttt 720 

tcagactngg aagattacca agtttgggtc ccccctn 757 

<210> 745 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 745 

cttnttnnnt ttnntttgat ncctctacnc aaacccttgg ctactngctc tttntgcagg 60 

atcccatcga ttcgaattcg gcacgaggaa naacagacag gtttcaacat ggatggatct 12 0 

gaaatgctgt tgaagcatat catttgcata aaaatcaggg acagtttcca aagaattata 180 

tatttttttc agttggctct ctagttagtt tttttgggag taaggacaaa cctggaatag 240 

atagcaaaac tgaaaatcan cagtgctgat ggtggtacat atgtctttcc tttagcttct 300 

cccctgataa ttcccatctg cttttacttc gggtgagcag agggggatgt gtgtgtgcgt 360 

gtgtgtcagt ctgtttgtga gtgtgttaaa ggctacagac cacagttggt ttaaaatgct 420 

tggaacttcc caaactggct ttactttatg tttatacagt gctcagggtt aacgcagtac 480 

atccatgcca ttgctgtggg aggtatcccc ggatgcatgt gttttgagtc tataaatata 540 

gaaaatatat attggtttct ttttccaact taatangttt attaaagcat gaaatgaaag 600 

ggtgcatatc atgcattcaa gntatntcct aatttttggt ctgacagtgc atgtctttgg 660 

agcatgctga aacaanaatn acacaggaat tgantaaccn gaaagaaaca ttgttaaatg 720 

tccaacattt gttatgcatt tntattgggg g 751 

<210> 746 
<211> 760 
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<212> DNA 

<213> Homo sapiens 



<400> 746 

tnnnnntntn nnnnntttnn nttcntnnnn ctttgaancc ctttgctact tgctcttttt 60 

gcaggatccc atcgattcgc tgaaacaaaa gatgtatttc aattaaaaga cttggagaag 120 

attgctccca aagagaaagg ctttactggn tntgtcangt aaaagaagtc cttcaangct 180 

tagttgatga tggtatggtt gactgtgaga ggatcggaac ttctaattat tattgggctt 24 0 

ttccaagtaa agctcttcat gcaaggaaac ataagttgga ggttctggaa tctnagttgt 300 

ctgagggaag tcaaaagcat gcaagcctac agaaaagcat tgagaaagct aaaattggcc 360 

gatgtgaaac ggaagagcga accangctag caaaagagct ttcttcactt cgagaccaaa 420 

gggaacagct aaaggcagaa gtagaaaaat acaaagactg tgatccgcaa gttgtggaag 480 

aaatacgcca agcaaataaa gtagccaaag aagctgctaa cagatggact gatnacatat 54 0 

tccaataaaa tcttgggcca aaagaaaatt tgggtttgaa gaaaataaaa ttgatagaac 600 

ttttggaatt ncagaagact ttgactacct ngactaaaat attccatggt ggtgaaagat 660 

tttcaagctt gngaatttgt aaattttnaa ctattatcta actaatgtnc tgaattgccn 720 

ttggctgtac tgggttatca ttttattaat ggtaaataaa 760 

<210> 747 
<211> 786 
<212> DNA 
<213> Homo sapiens 



<400> 747 

tnngncttta nnccntttnn attgnnnnnn nttgaaaccc ttggcnactn gctctttntg 60 
caggatccca tcgattcgaa ttcggcacga ggaggctgtg tcaaagaatg aatggaacgc . 120 

ctactatgag gaggtgggtg tacgtnctag anggagatcg agtacatgat ccagaagctc 180 

cctgagtggg ccncggatga gcccgtggag aagacgcccc anactcanca ggacgagctc 240 

tacatccact cggagccact gggcgtggtc ctcgtcattg gcacctggaa ctaccccttc 300 

aacctcacca tccagcccat ggtgggcgcc atcnctgcan ggaactcagt ggtcctcaag 360 

ccctcggagc tgagtgagaa catggcgagc ctgctggcta ccatnatccc ccagtacctg 420 

gacaaggatc tgtacccagt aatcaatggg ggtgtccctg agaccacgga gctgctnaag 480 

ganaggttcg accatatcct gtncacgggc agcacggggg tggggaagat catcatgacc 54 0 

gctgntgcca agcacctgac cctgtnacgc tggaactggg aaggaagagt ccctgctacg 600 

tgggacaaat aactgtgaac tggaccttgg ncttnctaac attggncttg gggggaaatt 660 

catnaacaag ttngccaana cctgcgtggg cccctgaaat acattctttt nggaccccct 720 

tgnatccaga accccaattg nnngnngaaa acttnaaana aantnncttt naaaannntt 780 

tttnct 786 

<210> 748 
<211> 722 
<212> DNA 
<213> Homo sapiens 



<400> 748 

tggaactngc tctttntgca ggatcccatc gattcgaatt cggcacgagg aggaagaggc 60 

ctgctccact tgtctgggaa cctgggcagg aggcacagag gaagccaagg cctggagctg 120 

caggtccccc ggcatctctc tctgtcccgg cagcccagga tggcctggtg cccccacctg 180 

ctgcagcagg agccccaagg agtgctagct gagggtggtt gctggggtgg tcctcatgga 24 0 

cagtgaggtg tgcaagggtg cactgagggt ggtgggaggg gatcacctgg gttccaggcc 300 

atccttgctg agcatctttg agcctgcctt ccggtgggag canaaaaggc cagaccctgc 360 
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tgagttanag gctgctggga tccactgttt ccacacancn ggaaggctgc tgggaacagg 
tggcanagaa gtgccatgtt tgcgtngaac cttgcantct tncanctggg gactggtnct 
tgctgaaacc cacgagctgn acantnanga gctgtccanc ttgcttggct cactgngacc 
aggaaagcct gtctttggtt agctcgtgtc ttctgcagga aaaaaaaaag gatgtgtcat 
ttggccatga tatttgaaaa aggggaagga tngccnaant ttgtttncca tttattccag 
tanttggaaa attttttgac cccctnngct taattctttt gcaanaacta ctggggggtn 



540 
600 
660 
720 
722 



480 



420 



<210> 749 
<211> 821 
<212> DNA 
<213> Homo sapiens 

<400> 749 

tttnaanncc cttgctactn gttctttttg caggatccca tcgattcgtn gacatagaaa 60 

acatacagta agaatatggt attataatct tacggggacc actgtcaaat cgcggtctgt 120 

ctttgaaaag ttgtnatggc ggcgcatgac tataaatacc ctagctggtt agcatttaca 180 

ttccttgcca gggagtttga aatttatnct nggcgggctg nctttaggnt ttaggtagag 24 0 

ttaaagaggt aaagcacatg tttgccacaa cccaggaaag tatttttaag aaagatttgg 300 

attttcctac ctttagagat ctaaaaaaaa tttaatataa aaaatcattt tgagntggtg 360 

tttattacta gttcagaatg agtggctgct gaagggggcc cccttgnnat tttcattata 42 0 

acccaatttt ncactttatt ttgaactctt aagtcataaa tgtataatga ctttatgaat 480 

tagcacaggn taagttgaca ctttgaaact ggccatttct gnattacact atcaaatagg 540 

aaacattgga aagatnggga aaaaaaattc ttattttaaa atggcttaga aaagttttca 600 

agattacttt ggaaaattct aaacnttnct ttctgngttc caaaactttg gaaaatatgg 660 

tagatnggac ctcattgcca tttaagactg gttttcaaaa gctttccctc aacatttttt 720 

aaaggtgtgg anttttccct ttttaaatat tccataattt aantttcctt ttnaaaggcc 780 

nctnnttttc ccaaacccat ngncttttgg ggnaaatccc c 821 

<210> 750 
. <211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 750 

gntttnnnnn nnctttnttn nntgnctntt tctaagagct tngcnnatgc tnggtcggca 60 

cgaggcaaca tttgtctaca actctactgt aaaattggaa atgcttttcc acagaaaaac 120 

ctctcaaaat gctgaatgca aaagttggga tcacagaaac attgtgccta tttttggtct 180 

gctggaaact gtatttntac aaggtaatcc ctgttctcaa tatagttcct gtcttgccac 240 

tggcggtttt cttgtagcat ttttctagtt ctgagattgc tactacccaa agtattcatt 300 

tctttcttac tggggtgtcc tctgtcttca cagcctgctt ctggattgta ggttttttcc 360 

tttctttctg ttgagatatt tatggcattt gatagagtca aaccagatgt attgcagccg 420 

gacatactta tgtggcttca gatgtgtaaa ataagtaact tcctatcttt gtctgtctag 480 

ctcaagagtt gactgtggac gaggaatgcc tgtattgatt cattaatgta ataactattt 540 

actgactgcc taccatgtac aaccagaaac acagttccta acctcatgaa cttaccatgt 600 

aacatgggaa gacaagccta agttcttatt tggntggnaa ttgcgataac gctcacagaa 660 

caaattcccg attcctacga acccatgtat aggggggaaa tatttaaggt cccatttaat 720 

actgacattn gcccnccccc ctnntatttt aagctgagaa tctgaaggnn 770 



<210> 751 
<211> 774 
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<212> DNA 

<213> Homo sapiens 



<400> 751 



cgttnnnttt ccncctttga 
tgtcagttct gtgtagactt 
actttttaat tcctttgatc 
agtaagcata gttttatcca 
aaaccaagca caccatagcc 
ttggtgtttg ttttttgact 
cagtaacgct gtttatctca 
aaaaaagatt gataaagtag 
actgagtgaa attgggatac 
catgtctcac ttggaaagct 
taattatccc tgcctgntgt 
tttaccttat ctggaaaaaa 
actnttagng agnccntttt 



agcccttttt gcaggacttt cnaatncttg gtagacttta 
tatgtcagtt tttgtcatta tttgaaaatc tattctgaca 
ttataagtta aagctgtaac aactgaaatt gcatggatca 
gggagacngc tcnnnggaag ccatagaatt gctctggtca 
ttaactgaat atttaggaaa tctgcctaat ctgcttatat 
gttgggcttt gggaagatgt tatttatgac caatatctgc 
cttgctttga aagccaatgg gggaaaaaaa tccatgaaaa 
atgattttgt ttgtatccct acccatctcc tggcagccct 
atttggctgt cagaaattat accgagtcta ctgggtataa 
agtcctttta aatgggtgcc aaaggtcaac tgtnatgaga 
ccatgtcaga cttttgagct gatcctgaat aataaagcct 
aaaacattnt anancaaaaa aaaactnnga gccctttana 
ccgtagaatc ccngacntgg ntaaggaanc nnnc 



60 
120 
180 
240 
300 
360 
420 
480 
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<210> 752 
<211> 778 
<212> DNA 
<213> Homo sapiens 

<400> 752 

gntttgaann ccnttgtttc gnatcctttt tgnaggactc tgaagncctt tgttcggcnc 60 

gagaagaaac tctgcctcag aaaatgttta cagcttccag tggaatcaaa cataccatga 120 

ccncaattta tccaagttct aacacattag tagaaatgac tcttggtatg aagaaattaa 180 

aggaagagat ggaaggggtg gttaaagacn ttgctgaaaa taaccacatt ttagaaaggt 240 

ttggctcttt aaccatggat ggtggccttc gcaacgttga ctgtctttag ctttctaata 300 

gaagtttaag aaaagtttcc gtttgcacaa gaaaataacg cttgggcatt aaatgaatgc 360 

ctttatagat agtcacttgt ttctacaatt cagtatttga tgtggtcgtg taaatatgta 420 

caatattgta aatacataaa aaatatacaa atttttggct gctgtgaaga tgtaatttta 480 

tcttttaaca tttataatta tatgaggaaa tttgacctca gtgatcacga gaagaaagcc 540 

atgaccgacc aatatgttga catactgatc ctctactctg agtggggcta aataagttat 600 

tttctctgac cgcctactgg gaaatatttt taagtggaac caaaataggc atcccttacc 660 

aaatcaagga agactgactt ggacaccgtt tggaaaatgg gtaaaaacgg tggnttactg 72 0 

gtganttggg gagcnagaac cggacccact ggtatactgg ggantaacaa tttttttc 778 

<210> 753 
<211> 775 
<212> DNA 
<213> Homo sapiens 

<400> 753 

gcttttgaaa cccttttgtt aacgcctttc tgcatgatct tctcgtcctt gaaagggccc 60 

taaaagagat gaacaatacc gtatcatgtg gtttgaatta gaaacccttg tcagagccca 12 0 

tatcaacaac tcagagaaac atcaaagagt cttggaatgt ctgatggcat gcaggagcaa 180 

acccccagaa gaggaagaac gaaaganacg cggctgaaag agggaagaca aagaggacaa 24 0 

gtcagagaaa gcagtgaaag attatgaaca ggaaaagtct tggcaagact cagagagatt 300 

aaaaggaatc ttagaacgtg gaaaagaaga attggctgaa gctgagatta taaaagattc 360 

gcctgattcc ccagaacctn caaacaaaaa accccttgtt gaaatggatg aaactccaca 420 
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agtggaaaaa tcaaaagggc cagtgtcgtt attatccttg tggagtaata gaatcaatac 480 

tgccaattcc agaaaacatc aggaatttgc tggaccgttt gaactctgtt aataacagag 540 

ctgaactata tcaacatctt aaagaggaaa atgggatgga gacaacagaa aatggaaaag 600 

ccagccggca gtgaagagtg acttgangaa ctaaatttta gcatattgca aaaatatttt 660 

gtgcgggaat tcgatatnag tacttttacc agcaagatgg natngttatg tttgcctgga 72 0 

ctggntttta catttttnaa attttttcag tgnccttttt tggtcctaaa ttatc 775 

<210> 754 
<211> 1032 
<212> DNA 

<213> Homo sapiens 
<400> 754 

ggnntttttc ccaaaaaaaa ggggccccct nggggntttt tncncaanng gncccctttt 60 

tctttgncca gggnaacntt ttttgngaaa aganccccct ttttggatnn accggggccc 12 0 

cccggaaggt tccnaaattt tnagggttna aacccaaatc cttggggaaa aaaaaaaaac 180 

ccagggccnt ntntggggnc cccctngggg gggtngggaa aaaaaaaaaa gggggaatgg 24 0 

cccccaaaaa aaaatnnggg gcccctnggg ggaaaaaaaa gggaaagccc aggtngggaa 3 00 

nggaaagggg gaaggntccc ccggggggaa aggaaatggg tgggtnggna atggcccaat 360 

ggttggaaaa ggcccaaacc aatttgggnt ntaaaacaat ttcaacctgg gggggtcctg 420 

gcccanaaaa aatgcngggc acccncgngg ggtctggctt aagaattggt tacaagggca 480 

aagggaaagg gaagagttct agagataaag aactatatgc ttggatgaag tgtgtgaagg 54 0 

gacagcctca tgatcacaaa catttaatgc caacccaaat tatacctggt tctgttttga 600 

cagatcttct agatgccatg cacactctta gggaaaaata tggtattaaa tcccattgnc 660 

attggactaa caaacagaat ttacaagttg gaaattttcc tacaatgaat ggtgtatctc 72 0 

aagttttaca gaatgntctt aatcacagna ataaaatttc tctgtgcatg cctgagtctt 780 

cagcagcaaa aatactcctc cgaagtctga gaaaaatggn ggcagcagcc caagaagagt 840 

gatgtaggca cagataacna aggntaacct cctccagaat ccccagtcac cactgcactg 900 

gttaagcaga acttngcagg agcaaaaaag cccngangan ggaaaaaaaa aannaaaaaa 960 

aactcggagc cctcttagaa ctatangggg ggccgnnnta ccgnangatc cccgacctga 1020 

anaggaaccc cc 1032 

<210> 755 
<211> 798 
<212> DNA 

<213> Homo sapiens 
<400> 755 

ngnnnnnttt nncccnacna aatccctttt ttgaagcctt ctantgnctt catcgtnctg 60 

gtaaattggn tgaattattg tattgaagct tgagctgtat tttnaagtaa tttnggttnc 120 

ccctaagatg ttattatgtt agggacataa cacttttggg aggttgttgt gggagatggt 180 

tgatttaggt tttcaaaagc tagaaataaa atttacatnn ccccggntnn cataaaattc 240 

tgctctaatt gggtggaagg tgctgtatct aacttgtgtt cctnctaagg ttatgtccta 3 00 

ataactattc ttttaggagt atacttctac tttatagaag gttgcttttt ctttttaatt 360 

ttntctaaca aagaaaagaa tnaagtattt attaataaag aaccagaaag cacttgaaac 420 

tgatgttttt aaatgggctc acttanggta gatttattta tctcattaac ttaaaaacag 480 

ctatgtgnat tgaaataagt cacaacagaa cttgaacacc agggtgggtg tctgagcaat 540 

cccctttctt atggggaaaa acaaatggtt cttgtttgaa cangaaggta tcattgcagt 600 

cngcattcac ccgtgtataa ttgnnatata agntgnataa tatgctcgta aaggctnaag 660 

gtnagctgga tctggatgcc ctttnaccaa ttangatttt aacttttaan aataaaattt 720 

naaanctaat tgncnaaata aaaaaaatan naaacttcgg ncctctacaa nttntagatg 780 
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ngtcgattnn cgnncanc 



798 



<210> 756 
<211> 834 
<212> DNA 
<213> Homo sapiens 

<400> 756 

tttgaaaccc ntttnttnaa gcctttttaa tgactttanc gncctttatt cggcacgagg 60 

tccttcagct ggtagcttnc attcgnantt nnanatanta tntgtgcatg cncnnttgaa 12 0 

tttttgtgga agaacagant gcagaagaag gcnaggaaag ccgaagagan tnntncggca 18 0 

ncagaagctt aaagnaggcc aaactggtgg tgcnctttcc tcggnacaga agctggatga 24 0 

ctatggccaa tttggagaaa nagctccagg agatggaggc acggttcgag aaggagtttg 300 

nagatggatc ggatgaaaat gaaantggaa gaacatganc tcaaagatga ngatggatgg 360 

taangacagt gatgaggncc gaagacnctg agctctatga tgacctttta ctgnccanca 42 0 

tgtgacaaat cgtnaanaac agtaaaggcc atgaanaatc acntagaagt caaangaaag 480 

cnnttgggaa aaatggnggn nctttgntaa aaccacnagc tgganggang gaagaannna 540 

aaatttttta agnacctcaa attgattgaa aaatncatta tgatgacaat tcctgnanga 600 

ataaattggn agatgcncta naancaaaan gcnttttttn antnnaaana nacaaannnt 660 

nnagcctntt ngaacntata gtnnannctn cntttanctn tntatcccgg actttnttnt 72 0 

ggataccntt gactnagctt ttggacaaaa ncncnacttt gtattncatt ngnnaaaaaa 78 0 

atgcntttat ttttcgnaaa tttggtgaat ncntaattng ntnntattnn nnnc 834 

<210> 757 

<211> 1062 

<212> DNA 

<213> Homo sapiens 



<400> 757 



tttttccaaa aaaatcnccc ccnttttttg gccttnaana 
gggccagggg aatnccccca atnccggaat tttccggggt 
gccccttggg gaaagggncn tttccnaagn aaaggggtng 
tttngggggg aaaaagcccc ctnggaatnc ccccaaaaaa 
aaaagggggg aacctttngg gnaatccttn cccnttnaat 
ctggtttggg aaagggaaaa gggttggtct tggtcttggg 
nttaaaatgg aaggtttggc canaatnggc cncttcgggg 
tttgggancc ctgcttcatt tttngggccc tttnctgcca 
tancaggaaa anggagatga aaggccttct tccaaggaag 
cnacttaaat gctttttgaa antctcttag atgtggnaaa 
atcaactngg tagaatttca attggaagca taatccattg 
atttggtaaa atgccttttt tggtggtgtg gttngaatcc 
natttcaaat ggtttaacaa angggaagga aaggganctt 
tgaccaggaa agattttnaa aagtaccttt ctttttaagg 
gaagaaaaat tgggatttgg attttanaaa aaangggaaa 
ntcnnannat nnttnatnnt ctanntantt nctntnnnta 
nannnannaa ataaatantc nnncatnctt anctacanat 
tttnannnta nntatctaan tctntcccta ttntaccctn 



nanngggccc cctttttttt 
ntttgggggg nttggaaagg 
gaaaattttn taaatggcct 
cccttgggaa aaagggggga 
aatttggggn aattaaancc 
ggaanggaat tgggggccaa 
gcttnttcaa aagccaagcc 
aggaanccca acccttaact 
gtaaggtcct ttggctgccc 
tattttttcc gaaccttgaa 
taaaatatat tttagttgat 
tgggtttccc aagaatcttg 
ttcccttaac cttccctttt 
aaaaaaaaaa attaaatttt 
aaaaatatna ntattnatan 
ntnctnntnt ntnnannnna 
nccnntcttn nttntannac 



nc 



84 0 
900 
960 
1020 
1062 
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780 



<210> 758 
<211> 845 
<212> DNA 



237 



WO 99/33982 



PCT/US98/27610 



<213> Homo sapiens 



<400> 758 



aaancccttn tttnaaatcc 
tagcgtcggn tccgcntggg 
tntgcttacg ggcaaaaacc 
gaaggtgatt gcctttgggg 
tgntttggac agaantggga 
tgtgttcctg tcantcacgg 
gtccacggca tgcccaattg 
natcaagcct aatgtgggag 
ggattgggtg ggantggang 
aaaaactttt gggtttttaa 
tgggcctatt naaaaaaaan 
aaantcttnn agccttttta 
ntaacnttta attaaaggaa 
tttaaaattc ntntngaaaa 
tgctn 



tttttanang attcatcgat tcgaattcgg nacgaggcgc 
cccttgcggt gcgctgnggg caggcggtga ggcttacgcn 
tgcacacgca ccanttcccg tnnccgttgt ccaacaacca 
aancttctan gncaacnacn tgaacntatg gacagtgcgc 
acnttnaggn tgntgtgcgc ttcnagcatn tgggcacctt 
gtgagcanta tggaagcccc atccgtgggg cagcatgaag 
caacacgcac aaatacttgg aangccatgg aangcatntt 
cccttttgca agtcacgaat taactctnaa nngtntggat 
gttgcaagtt ngggccnttt tgaaaggcca ctttttggna 
ngggttcntc aaaatgccct ttgnnaattn aaagaaatgt 
atnatacttt atntaatctn nataataata nttantaata 
aaanttttta atgaanctct ttattttanc gttanantnc 
taacaatttg ttgaantttt ggtataaana ncccccantt 
aaaatncntt tattttggta aaaatttgng gaatcnnttt 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
845 



60 
120 



180 



<210> 759 
<211> 947 
<212> DNA 
<213> Homo sapiens 

<400> 759 

tngggggggg ccccnanttt ggggccccaa acccttnggg gaaacccccc ttnnnnnttt 60 

ttnccntttt gggggggaaa ngcccccccc caaangnaaa aacccntttt nnnnaatttn 120 

ngggnanggg ntntggggnc ccnttaaccc caangggggg gggttttnan cctgggggnn 180 

naaaatnggg ggaanaantn nnnaatgggn antcccttna angggaaaaa naatttnncc 240 

ttaaggnnat gggncattaa tnttnatccc tantggattn caatttcatt cgnattaaag 300 

gcttttactg gnataatcct tnncggcccg cnctggtagt ttaaagtgcc canaanttga 360 

atgggaaatn acgggttttg aaaatcgcac aaagcagtgc cnggcacnga ggngtcacgc 420 

cngtaatncc agcattttgg gaggcctgag gcangcggat cacganggca anagagtcca 480 

gaccattnct ggctaacacn gggaaacccc gggnctaata aaaaatcaaa aattaggntg 540 

gacatggtgg cacgtgccng taatcncagc tacttangga agctggatgc aggaagaatt 600 

gcgtgnnanc cnggccccng tggaangntg cattgatacg aagaaccgtg ccaaatgaan 660 

ttanannctg ggcngaannn gagcggaaaa agccctnttt aaaaaaaaan gggantggaa 720 

aaantggtgc canagncatn nggggaaaaa attttnnnnt tnnttnancg gttttnanct 780 

tgnggaaggc cntctttaat nttggggaaa aggcactttt gggntnggtt ttggaaaacg 84 0 

nntggctttt ccctttnaaa agggaaaaan ggnnttaanc ccctgaaaaa ngngcngnnt 900 

tttaaanggg gnnnnaaaca nggggncttt ggaancccca nnaaacc 947 

<210> 760 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 760 

gnntttctaa tgcttgtnnn nngcntttnt gcaggatccc atcgattcga attcggcacg 60 

agaagatatg cagagatatt ccaggatctt ttagctttgg tgcggtctcc tggagacagt 120 

gttattcgcc aacagtgtgt tgaatatgtc acatccattt tgcagtctct ctgtgatcan 180 
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gacattgcac 
ccaattctct 
tagctaactc 
ttgcanagca 
ctctgcatag 
catcttcatt 
gagatgataa 
atgctgcaga 
ccttgaacta 
ttggtngaca 



ttatcttacc 
accaaataaa 
tgagagcagt 
tgattatgga 
tttactgaaa 
tttagaattt 
tggtctcatg 
gttaaaacag 
gagaaacttg 
gtgtaatttg 



ggctcttctg aagggtctat ttctgaactg gagcagctct 
gaattgatga cctcaatctg tgactgtctg ttggctacgc 
tacaactgtt tactgacatg tgtcagaaca atgatgtttc 
ttatttcatt taaaaagttc tttaaagaaa aacagtagtg 
cgagtggtca gcacatttag taaggacaca ggagagcttg 
atgagacaaa ttcttaactc tgacacaatt ggatgctgtg 
gaagtanaag gagctcatac atcacggacg atgagtatta 
cttctacaaa gccaaagaag aaagtncaga aaaatttgtn 
ntntggaaca tttcaaaaga tgaatgacaa tctggattcn 
gactttaacc ngatgctcg 



240 
300 
360 
420 
480 
540 
600 
660 
720 
759 



<210> 761 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 761 

cctnactaaa cctttgcnaa ngccnttnnt gctgatccca tcgattcgca ggcctggact 60 

tcgcccccag gcctaggacc gcggtgggtn ttaaccctgc tnctgcccca acagggactc 120 

caatcaatcg gagttctccc cttgccggag ctgcccttca cctttggggc ccgagacagt 180 

cataagggat ggacttacnt ttcttgcagg gaaaaaggtg gacagccgtg tttcttaagg 240 

atgctgaggg catggggcca ggaccagggg agaggcacag ctccttcctg agcagcctct 3 00 

caccactgcc acaaggctcc ctaatgctgg tctctgctcc actccccggc ttcccgtgag 360 

gcangaggca gagccacagc caaggccctg accacttctg tgccagttgt ctaagcagag 420 

cgcctcaggg acgctggaaa tgccttaagg atagaggctg ggcatcacat caaatgggac 480 

tgtggtgttt ggtgaaaacc ttcctgagga tctggattca ggaccctcca tgactggcct 540 

atttactggt tacagctggc cagtgcanan ctgctgctct tttacctttt taggcccctg 600 

taacttncca cctttaaact gcccaanaag catgcctntt ccacaggaag aagggagcag 660 

acagggaaat ctgcctacca anaagggtgt tgtgtgtctt tgtgcccaca cgtggtggct 72 0 

ggggaatgcc tggatggtgc cgtggntgat ct 752 

<210> 762 

<211> 1032 

<212> DNA 

<213> Homo sapiens 



<400> 762 



ttctaatgct tggaaacgcn ttgatgnang atnccatcga ttcgaattcg gcacgagggc 
aagtggtagt ggcgcttntc gggtgntgtg cttcacgttt tggtctaaag gncgagactg 
ttgtggcnac ngngnaantn tacnggaang gnttaaantn tnnntgnagt nggaanaatt 
cnatcngaan gaanttgggg gggntagnnn nggttanatn attgatgaat ggnttcaana 
tngnaaantt tatnancgan atgnnatant tnnaaangan gaccaactgg gntnanatgg 
agnannnatn aannggntaa ncnatanana tantncattt ggtanganaa tngangaagg 
attntcaaat agncatgtng gangatgaac ntnnaggnnn nagaatattt ggataaaatt 
ggtantatga agatntggnn taataatacc nanaaatnnn nnantttnat nanngangaa 
ntagganttn atgnctatgn ggatannntn nanntatnat agngataaan tatgatactg 
tttannntat ntnganttag tnattnaatg ntcttgtnan aanttatttt ncgntagtta 
gntagnnnta tnnactttgg naancanana tgtaattctc tctanacggg aatntttnta 
tnntnnntat caagaggtnt ntnnattgna aatantatac nnttgnanaa antatatcna 
tanaanaaan ggnnattatt ntatatganc aaanaaaaaa ntattgngga nntanattat 
ctctcatnat ngattatncn gtantgtata atggnnnata antatgtnnn tntaanataa 
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atggatataa gtnttatant atgcncntna aggnggtcng anaantatgt aattatattn 900 

angctanata cnatnnanat gtntnactaa atatngntgt gaaangtntg cgnggnaaaa 960 

tntgttanta ntnaaacang gtataganat atanatgngn ngaatatcta ctatntgtan 102 0 

atacttatan ca 1032 

<210> 763 
<211> 817 
<212> DNA 

<213> Homo sapiens 
<400> 763 

aanncccttn tttctaatnc ttggctactc gtnctttctg caggatccca tcgattcgaa 6 0 

tttcggcacg aggggaggga cccttggggn caggttgtgg gtagccagtt gcagtctgtg 120 

gcctccctca gaggtttgga gtcgggcgtg gcatgctgct gttggcctct ttccgaggga 180 

gtgccatcca ctccctgtcc caccgctnnc cctngtgagg acagtgaggg cagtgctacg 24 0 

tggtggggag gtgtgtgtga agccacggaa gggcttcaca gggcaaatgc caaggccagt 300 

gggccccgga cagagtnagg ctccctgggc ggncttgtgt cttggtggcc ctgatcatcc 360 

tgccaatgca naaagccagc aggcaagaga cccctactcc ctttaaggac cattagcata 420 

aacaaaccat tgngttgaat gcaatgatcc aggtgcactt tnagggtaca agctggactn 480 

gttggaacag gattacatgg aaaannggaa angggggcan gctgtctctt gggacatnag 54 0 

taatgtcttt ttacccantt gncactctng aanttcaaan ttggncatgt tttctggggc 600 

ctnctngnaa aagcagtttt ttcaccncat natgaagaaa aaacttgttg gcttgganng 660 

tanngggatt nttgntnana cttnccctaa anggntncct ttnggggcat ttntgaaggn 720 

taaataatgg gggatacctt tttaannttc cttgcagatt taaaaatgtt ccttaaanga 780 

nncctcaatg nttnggtctt nttccaaaaa acnattc 817 

<210> 764 
<211> 777 
<212> DNA 

<213> Homo sapiens 
<400> 764 

taatgcttgg ntctcgnttt tntgcaggat cccatcgatt cgaattcggc acgaggtcca 60 

cggtgctgaa catcatcatc tttgaagact gtaggaacca gtggtctatg tcccgaccac 120 

tacttggctt gatattgctt aatgaaaagt atttttctga cctaagaaac agtattgtga 180 

acagccagcc acccgcgaag cancnggcca tgcacctgtg ttttgagaac ctgatggaag 240 

gcatcgagcg aaatcttctt acgaaaaaca gagacaggtt cacccagaac ctgtcagcat 300 

tccgtcgaga agtcaacgac tcaatgaaga attccactta tggcgtgaat agcaatgaca 360 

tgatgagctg acacctnctt ggactctacc tgtacagagc agcgtccctt tggtttggcc 420 

cagaggggcg aacaattgca agggagaggg cctggctgat cctggctctt ttctccaggg 480 

gtgtggggaa aaatggcaaa ggtcaactag ctgcttcccc aagggaatag gggtgtgagt 540 

acactcacta nggggcaagg cgctgcttgg ttcctggggg gactgggtgg gaaagggtgg 600 

tgnganggag ataaagagat tcaaactgag actccagtct ttccttctgg gggccaccca 660 

aagttgggga gnaaccccct antggtncct gccaacaacc ttgccttggg attaaacatt 720 

ntncattttt ttcantaana tttttgaaca aagggttant attgnctnaa gtttann 777 

<210> 765 
<211> 774 
<212> DNA 

<213> Homo sapiens 
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<400> 765 



ntttctaatg cttggctctc 
aaaacagctt tgtaggtctc 
tatcaataca ccagtggctg 
tgtgcaaaca gctatggann 
ctgcaaaagt acaatagaac 
tggcacacaa agcttgctgc 
atgttgtttg ttacttcccc 
ttcccgaatg agtcaaaatg 
actgatgata catacctttt 
tgagcttctt gctggaagat 
agtaccctct tacttacaga 
catatttgaa aaatcctggg 
ctggnaagcc agattantng 



gntttgatgc angatcccat cgattcggga aatgcaagtc 
agagtttgct tttaagaagt agtacaagaa ggaatagtta 
aaattatcat gaaaccaaat gttggacaag gcagcacaag 
gtgaactcgg agagtctagt gccacaatca ataaaagact 
tttcagaaaa ttctttactt ccagcttctt ctatgttgac 
aacctcattt agagagggtt gccatcgatg ctctacagtt 
caccaaatcg tagaaagctt caacttttaa tgcgtatgat 
ttgatatgcc caaacttcat gatgcaatgg gtacgaggtc 
ctcgatgtgt gttatgctgt gctgaagaag tggatcttga 
tagtttcttt cttaatggat catcatcagg aaattcttca 
ctgcagtgga aaaacatctt gactacttaa aaaaanggga 
agaanggact atttggctnc ttttgccaac ttacttcata 
ctcaaggaag ttttgatgag ccaaaaaagt tttn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



774 



720 



<210> 766 
<211> 779 
<212> DNA 

<213> Homo sapiens 
<400> 766 

ttnnncgctn ntgaanaccc cttctcctna aatccttttt aantnccttg ctgnntgatc 60 

ccatcgattc gcgaaattcg gtggcgccac gtccgcccgt cttngccttc tgcatngcgg 120 

cttcggcggc ttccacctag acacctaaca gtcgcggagc cggccgcgtc gtgagggggt 180 

cggcacgggg agtcgggcgg tcttgtgcat cttggctacc tgcgggtcga agatgtcgga 24 0 

catcggagac tggttcagga gcatcccggc gatcacgcgc tattggttcg ccgccaccgt 300 

cgccgtgccc ttggtcggca aactcggcct catcagcccg gcctacctct tcctctggcc 360 

cgaagccttc ctttatcgct ttcagatttg gaggccaatc actgccacct tttatttccc 420 

tgtgggtcca ggaactggat ttctttattt ggtcaattta tatttcttat atcagtattc 480 

tacgcgactt gaaacaggag cttttgatgg gaggccagca gactatttat tcatgctcct 54 0 

ctttaactgg atttgcatcg tgattactgg cttagcaaat ggatatgcaa gttgctgatg 600 

attcctctga tcatgtcagt actttatgtc tgggcccanc tgaacagaga catgattgna 660 

tcatttttgg tttggaacac gaatttaagg cctgctattt accctggggt atccttggat 720 

tcaactatat catcggangc tcngtaatca atgagctaat tggnaaattn ggtggacac 77 9 

<210> 767 
<211> 799 
<212> DNA 

<213> Homo sapiens 
<400> 767 

gnnnnnnttn cccgcctttn gaaancccct tctttctaat gcttgttcaa cgcctttgct 60 

gcaggatccc atcgattcgt ggatactgac aatggtggca ggcatttcaa gccttttaaa 120 

ttagtacttt ttgtcgnctt gcttattaaa attttgttaa ttttagcaaa gaccaattgt 180 

tgtgataaac tggtgttttt nggatgcttc aagcacacgt taaccaatcn gccaatnccc 240 

ctttnggttc ctcccattgn tctaaaatag gactttcata ttattaaaac ctcaaaagat 3 00 

gatccaccca ggatgaacaa agatcaccaa ggggaaagaa aacatttttt atctttacag 360 

aaaacatgtt aagattatat atagatgtat tctttacatt ggatattgta ttagagtcct 420 

ccttacaaga aatgaaatag gtttttagca ctcttagcat tagagttcct agattggtgt 480 

tgatagctac agttttaaaa tgtataacct gaaaatgaag gttaattttg cattgtaaag 54 0 

agcacatttg atctatgtaa aaagtgtcca tttggtgtat tttttttaaa aaagagaaag 600 
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cactttcata ttaagtagca tgtgtatgaa tttaagattt tcatatttgn tgngtctggt 660 

attcagtgaa gtaaaattga gcattttaaa agtttggtgg atggcaacca ttaactatta 72 0 

aattaaaagc caccttatac tctgctgctt aacttgcttg naaattgcac ctttggnacc 780 

ctgcacattt tcatattnc 799 

<210> 768 
<211> 826 
<212> DNA 
<213> Homo sapiens 

<400> 768 

gnnnntnncn ccctttctaa tggcttgttt ctaaatgctt tttcnaatcc ttggtacatg 60 

atcccatcgn ttcgcgctgt gcttgagacc aacctgacgg gtaccttcta catgtgcaaa 12 0 

gcagtttaca gctcctggat gaaagagcat ggaggatcta tcgtcaatat cattgtccct 180 

actaaagctg gatttccatt agctgtgcat tctggagctg caagacnggg tgtttacaac 240 

ctcaccaaat ctttagcttt ggaatgggcc tgcagtggaa tacggatcaa ttgtgttgcc 300 

cctggagtta tttattccca gactgctgtg gagaactatg gttcctgggg acaaagcttc 360 

tttgaagggt cttttcagaa aatccccgct aaacgaattg gtgttcctga ggaggtctcc 42 0 

tctgtggtct gcttcctact gtctcctgca gcttccttca tcactggaca agtnggtgga 480 

tgtngatggg ggccnggagt ctctatactc actcgtatga ngtccagatc atgacaactg 540 

gcccaaggga gcangggacc tttctggtgt caaaaaagat gaaaggagac ctttaaggag 600 

aaagctaagc tcttgagctt gangaaaaca aggggtcctt ccatncccca aatgccttta 66 0 

catttttgga ggatatgcct nnnggnacnt ttttaaaaaa gcttatnagt tngntatggg 720 

naaaacaatt ttttccttan tttttaaagt ggntaataaa tnaaantcct aatggnaaaa 780 

aaactantcc ttggnaanta ttttccaggn cttnantgtn cccncn 826 

<210> 769 
<211> 802 
<212> DNA 
<213> Homo sapiens 

<400> 769 

gnnnttctaa tgctgttcta atgcttgtca atncttgana cgttcatcga ttcgggaagc 60 

caagcctgga gctgcaggtc ccccggcatc tctctctgtc ccggcagccc aggatggcct 120 

ggtgccccca cctgctgcag caggagcccc aaggagtgct agctgagggt ggttgctggg 180 

gtggtcctca tggacagtga ggtgtgcccg ggtgcactga gggtggtggg aggggatcac 240 

ctgggttcca ggccatcctt gctgagcatc tttgagcctg ccttccggtg ggagcagaaa 300 

aggccagacc ctgctgagtt agaggctgct gggatccact gtttncacac agcgggaagg 360 

ctgctgggaa caggtggcag agaagtgcca tgttngcntt gagccttgca gctcttcagc 420 

tggggactgg tgcttgctga aacccaagag ctgaacagtg aggaggctgt ccaccttgct 480 

tggctcactg ggaccaggaa agcctgtctt tggttaggct cgtgtacttc tgcaggaaaa 540 

aaaaaaagga tgtgtcattg gtcatgatat ttgaaaaggg ggaaggangc cnaaanttgt 600 

tcccatttta ttcaagtatt ggaaaatatt tggcccccct ttggctgaaa ttctttttgc 660 

aanaactaac tgngtggctt gttcncttac cctttttcan gnttaattgg tttnaatttt 720 

ttgcattgaa attaaagacg tttttaaatt tcntttncaa naacaaaggg cttanatncc 780 

ngantcnana nattggnant tc 802 

<210> 770 

<211> 1157 

<212> DNA 

<213> Homo sapiens 
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<400> 770 



cccttttttt tttttcccnn 
ccnaaaaaaa aattgggncc 
tttttggggn nnnnnaaann 
gggnnnannc chccccccaa 
cagggnnnag aataaggccc 
caaaggnaaa tccnttggga 
aaaaatgggg aaangnaaaa 
tttcattggg ggtncntggg 
cctttaattg ccattaagca 
cttcganagn gaaaatcaac 
gaagctagaa cattagaagc 
aactgccaaa gagtgttgtt 
ngaaaaaaac tttatangtt 
ttctnccaag agatatcctt 
aataaatnnc ttcnagaagc 
agcacgttct tgacnacaga 
aaggtgacta ttnaanagct 
ttcaanncca tttttangat 
gtttaaatta nnaaanaanc 
ttttnnncnt ntnaaim 



aaaaaaanat tggggncccn tttttttggg nttttttttc 
ctttttgggg ggnntnaaaa aaannnnnnn nccccccntt 
tnnnnnncnn nttnnnnnnn rmnnnnnnnn ggnnttnnng 
tttcccggnn attnttccgg gcccaatttt tgggaccccc 
ggggnttttt tttncnaggg ncccaaaagg gcccttgggc 
aattttggga atttggccct tggnanntcc caataccggn 
aaggnttncn ccaaattggt tggggggggg ttccaaagat 
ctttcaaccc naaggnaang ggtttncttt caaaaaatta 
attcccaang gttannaaag ggtgtttntt ctcanctatg 
naatggaaaa tgtgttgtaa ttggtctgca ntctacanga 
tttggaanag ggcggnggag aattgaatga tntttgnttc 
gcagtcactc atttgaaaaa ctattttcct gctccagaca 
tactaggaat cgatttgaca agcnttcang taacaaacag 
gttnaagaan nattanaata ncnngaaagc ggaaanngtg 
ccaaaaannc acngaanaag tatggtgggn cttactggtt 
tggaaattga antctngatt ncctctgatt antgaatgaa 
cttnanatac catgagtntt tggancattg attgaccaat 
ngaattntta tnaatgattn attnanaant gannnccttn 
cntcnaaana cnanagggga tttataaaat ctaataanan 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1157 



<210> 771 

<211> 760 

<212> DNA 

<213> Homo sapiens 

<400> 771 

ngncctttna tnccttntga anccntttgn aattnctcnn nnngttgatc ccatcgattc 60 

gaattcggca cgaggtggaa gaaaattttt tgctgcttct ggttnccaga aaagggagcc 120 

attttaacag acacatctgt caaaagaaat gacttgtcga ttatttctgg ctaatttttc 180 

tttatagcag agtttctcac acctggcgag ctgtggcatg cttttaaaca gagttcattt 240 

ccagtaccct ccatcagtgc accctgcttt aagaaaatga acttatgcaa atagacatcc 300 

acagcgtcgg taaattaagg ggtgatcacc aagtttcata atattttccc tttataaaag 360 

gatttgttgg ccaggtgcag tggttcatgc ctgtaatccc agcagtttgg gaggctgagg 420 

tgggtggatc acctgaggtc aggagttcga gaccaacctg accaacatgg tgagaccccc 480 

gtctctacta aaaataaaaa aaaaattagc tgggagtggn ggtgggcacc tgtaatccta 540 

gctacttggg aggctgaacc aggagaatct cttgaacctg ggaggcanag gttgcaagtg 600 

agcccgagat cgtgccattg cactccaacc agggcaacaa gagtgaaact ccatcttaaa 660 

aaanaaaaan gaaaactcga gcctctagaa ctatagtgag tcgtattacg tagatccaga 720 

catgataaga tacattgatg aattttggac aaaccccann 760 

<210> 772 
<211> 777 
<212> DNA 

<213> Homo sapiens 
<400> 772 

gaaancccat ttnnnnnttc cncttcnaat cccttggnta ctcgntcttt ntgcaggatc 60 
ccatcgattc gaattcggca cgagctctac taaaaataca aaaattagct gggcgtggtg 12 0 
gcacacacct gtaatcccag ttacttggga ggctgaggca caagaatcgc ttgaacccgg 180 
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gaggcggagg ttgcagttag ccaagatcgc cctgctgcnc tccagcctgg gcaacagagg 240 

gagactctgt ctccaaaaac aaaaacaaaa actgttagtg aaggttccct gggacttttg 300 

atattttaaa aattgttctt atgactagta gataaattca ttgccataat gaggctagct 360 

cccagataaa cagtgtattt tcttcttttt tttttttggt gagtggtcca gagctttaag 420 

ctacttttcc agtagtttgc cactttctcc gaggtanttt ggctgctctt tcagtaatgc 480 

taattgtgtg tcaaattttg tctacaacag taggcaacag atgaagataa gttggttgaa 540 

tgtctccagc actatgcatc cctattttct atttattggg gtacactcac tttcagtaat 600 

gngtttcaaa ctggtatttt ttaaaaaaca aatcaatgta aggactgaag ttgaaatanc 660 

caatgtaata aagttaatta gggttatttt taaaaaaaan aaaaataana actcnagccc 720 

tctagaaact atangtgagt cgnnttacct tgaatcccag accttgataa gatacnc 777 

<210> 773 
<211> 782 
<212> DNA 

<213> Homo sapiens 



<400> 773 

gnntnnattc ccctttcnaa tncttggcaa acgctctctn tgttggatcc catcgattcg 60 

aattcggcac gagacagtct cgggtttcat attttgctgt ttttgatgga catggaggaa 120 

ttcgagcctc aaaatttgct gcacagaatt tgcatcaaaa cttaatcaga aaatttccta 18 0 

aaggagatgt aatcagtgta ncncccgccg tgaagagatg ccttttggac actttcaagc 24 0 

atactgatga agagttcctt aaacaagctt ccagccagaa gcctgcctgg aaagatgggt 300 

ccactgccac gtgtgttctg gctgtagaca acattcttta tattgccaac ctcggagata 360 

gtcgggcaat cttgtgtcgt tataatgagg agagtnaaaa acatgcagcc ttaagcctna 420 

gcaaagagca taatccaact cagtatgaag agcggatgaa gatacagaaa gctggaggaa 480 

acgttaaggg atgggcgtgt tttgggcgtg ctagangtgt cacgctacat tggggacngn 540 

cantacaagc gctgcngtgt nacctttgtg ccccgacatc agacgctgcc agctnacccc 600 

caatgacagg ttcattttgn tggccttgtt atnggctctt naaaggnctt tnccccatna 660 

aggaagccng tggaactttc atcttgncct gnantcgang atnaaaaagn atncagaacc 720 

cggggaaggg gaaaatcctn aannctgact tcccggtttc caaaccagtn ttgnaacaaa 780 



<210> 774 
<211> 793 
<212> DNA 
<213> Homo sapiens 

<400> 774 

gnannngccn cgnttttgat tccccttntt caaatccttt gnnaatcgcc ctcnctgttt 60 

tgatcccatc cgattcgaat tcggcacgag atggcagttg cttttgaagt atatgatgnn 120 

ttcctccact acaaaaaggg gatctaccac cacactggtc taagagaccc tttcaacccc 180 

tttgagctga ctaatcatgc tgttctgctt gtgggctatc ngcactgact cagcctctgg 240 

gatggattac tggattgtta aaaacagctg gggcaccggc tggggtgaga atggctactt 3 00 

ccggatccgc agaggaactg atgagtgtgc aattgagagc atagcagtgg cagccacacc 360 

aattcctaaa ttgtagggta tgccttccag tatttcataa tgatctgcat cagttgtaaa 420 

ggggaattgg tatattcaca gactgtagac tttcagcagc aatctcagaa gcttacaaat 480 

agatttccat gaagatattt gtcttcagaa ttaaaactgc ccttaatttt aatatacctt 540 

tcaatcggcc actggccatt tttttctaag tattcaatta agtgggaatt ttctggaaga 600 

tggtcagcta tgaaagtaat agagtnttgc ttaatcattn ggaattcaaa catgctatat 660 

tttttttaaa aatcaatgtg aaaacataga cttattttta aattgntacc aattacaata 720 

aaaataatgg gcaattaatt tttnaaaact ttttaaaata gnatgctcat atttttaaaa 780 
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ataaaanttt tnc 793 

<210> 775 
<211> 1009 
<212> DNA 

<213> Homo sapiens 
<400> 775 

agcntttttt ngaanttccc ctttnnttna aaaatcccct tttttggcaa aaaattnccc 60 

ccntntntna nngtttttnn gatncccaca tncngnaatn tncgggcncg ggnnactgnc 120 

nannggcncc cttcgggggn ccngtgntaa gncnatnctt gtntntanaa agntggnnnt 180 

nttttncgat ngngactatt gncnacnctc ttccntnttg gcagngngtc tgganggttg 240 

nggtngctca tntggntaan ccnatcctgg ngaccaanng gccgnggtgn gcntgcaagc 300 

tttgnccacn tgggaaancc gnnagtggtn gtctcanttg cntgntgggn ncntgncccc 360 

atcttgnctg ctgnancctt ggggagcagg nnctnggtng tggtnctgcc tgcttgctgc 420 

tngttccccg ggcatgcgtn nncannaagg gncatgcntn gggcaanaag gtgcgtggnc 480 

ancgtnngna tnnnnaggac caccntgggt cgngaatcnn tgggttncct gataggaacc 540 

ntnaannnct gcngntttta ttaaatggga nnanangggt ncanttcaaa gccagtnnaa 600 

tgcccttatg gaanggngtg natnacatan cnnnntatgt gtcntanann angaaatcgt 660 

tnnncaaatt tnnacaanaa tntttntaan aaagggtatt tnantntngg tgaaanaaca 720 

angntttaaa gtnaaatgnt tntancanaa ttaantaaac nggtnttnat gattncttac 780 

naaantaacn atncnnaagc atttacngct tanangtccn cnngatactn ncanaatatg 840 

gnnnnaattn tannanatng cgataatctn gnananactn tcatnnnnna tngtgtaatc 900 

antanntacn tgatttnnnt naaatgaaaa catntgatnc aagattaatn cattanntat 960 

acnaaaatnt tcanatanta natntacata taatggtttc naataaacn 1009 

<210> 776 
<211> 785 
<212> DNA 

<213> Homo sapiens 
<400> 776 

gnnnnnnntt cccctttcta atcncttgga nntcgctctn tntgnangat cccatngatt 60 

cgaattcggc acgagagaaa cacaggtgtc gtgaaaacta cccctaaaag ccaanatggg 120 

aaaggaaaag actcatatca acattgtcgt cattggacac gtanattcng gcaagtccac 180 

cactactggc catctgatct ataaatnngg tggnntcgac aaaagaacca ttgaaaaatt 24 0 

tganaaggag gctgctgaga tgggaaaggg ctccttcaag tntgcctggg tcttggataa 300 

actgaaagct gagcgtgaac gtggtatcac cattgatatc tccttgtgga aatttgagac 360 

cagcaagtac tatgtgacta tcattgatgc cccaggacac agagacttta tcaaaaacat 420 

gattacaggg acatctcagg ctgactgtgc tgncctgatt gttgctgctg gtgtnggtga 480 

atttgaagct ggtatctnca agaatgggca nacccnaaag catgcncttn tggcntacac 540 

actgggtgtg aaacaactaa ttgtcggngt taacaaaatg gattcacttg accaccctan 600 

aggccngaag agatattgan gaaattgtta aaggaagtca gcacttncat taagaaaatt 660 

ggcctacaaa tccnnganac aataancatt tgtgccaatt tnngggttgg gaatgggtga 720 

ccaacattgc ttggagccca agtgnttaac aatgccttng gttnaaaggg antggaaaag 780 

ttacc 785 

<210> 777 
<211> 1366 
<212> DNA 

<213> Homo sapiens 
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<400> 777 

ananaanann annrmnnnaa ggnnaanana nnnnnannnn naanangnaa anananannn 60 

tnnanaannn aagnngnttc nanncttttc aaagcttgga aaacgcannc aannnnnggg 12 0 

aaagcaagaa agaacagcta aagnnngncn cagaganagc ttttangang tntangaaga 180 

aggaatannn gnggncaata nnnnannnnc ngaaantatc atganacnca aatganggan 240 

aaggcagcac aagctgngca aacagctatn gngacggggg ggccgggaga gnctaaangn 300 

cananatnca atatataagg actgcatgcn aagggatacn aaacaagnan actnntctag 360 

gaagaaataa ntnttgacnt ancnnacntt cataacgaat agcaccgtac atcgagncaa 420 

ccaactaana ggnctaagga aatggcaaan nacnttaatn nntgagcnaa ggaagggngt 480 

atngnccnan anngaaatgc ntcntaacca anttttaatn gtaacggnat nangatnaan 54 0 

ncntnanccc acgcaactca aaaanattac attanntaaa aaaganctat ancaaaacta 600 

gtnttcaaaa tngnacgagn aaatgggnaa nantttntnn ccgggaaaat tggnagagat 660 

ccanaaacac tggntnaggg naatanatgn ccgcccnaaa aaaccntnac cataggnatn 720 

ggctancata gangagatat ancnatnagg ggatcaanan cntaggnatt ngaaaantaa 780 

ncgagttaaa acancnagat nnggnantac gaganatagc ttggacgngt atcaaatcgg 840 

accctnggat gggcntangg aaaaanaaaa aggntngagn gaanttcctc anaggaanng 900 

tganagagcn aaanaanatn aagggccttg gngaaaangg aaaaacagat agngtcatnc 960 

natatatncn natgananan tggggnaatn taatctacnn tanatnnggg ggaaaaaaat 1020 

cnnncatgac nnnaaaanga gntaatgnna nnatgagaga ttaaacnnat aaaacnagag 1080 

aantttgngn aaanctgnga gataaaaaat aaataaattc tntntggaac atntanaccn 1140 

tctatnnaaa aaaaagaggg gaaaccatct ngattatgca cananaaatn tnacntngng 1200 

gaaataaatn gggnacaata acatatatgn ggatgtacan tnntggncng aaaaactata 1260 

caacntgaga nnnnacnang atataaagcn nnaggnagtn tatangggca tcatcaangg 1320 

gaagntataa agcaactgna nnctcatata naaaactgnn cnncaa 1366 

<210> 778 
<211> 775 
<212> DNA 

<213> Homo sapiens 
<400> 778 

gntttnnatn cctctttcta atnncttggc tactcgntct ntctgnanga tcccatcgat 60 

tcgaattcgg cacgagagat tatgagcatg tagaagatga aacttttcct cctttcccac 120 

ctccagcctc tccagagaga caagatggtg aaggaactga gcctgatgaa gagtcaggaa 180 

atggagcacc tgttcctgta cctcccgccg ccgaacagtt aaaagaaata tacccaagct 24 0 

ggatgctcag agattaattt cagagagagg acttccagcc ttaaggcatg tatttgataa 300 

ggcaaaattc aaaggtaaag gtcatgaggc tgaagacttg aagatgctaa tcagacacat 360 

ggagcactgg gcacataggc tattccctaa actgcagttt gaggatttta ttgacagagt 420 

tgaatacctg ggaagtaaaa aggaagttca nacctgttta aaacgaattc gacttgatct 480 

ccctatttta catgaagatt tttgttagca ataatgatga agttgcggag aataatgaac 540 

atgatgtcnc ttctactgaa ttagatccct ttctgacaaa cttatctgaa agtgagatgt 600 

ttgcttcttg agttaagtag aagcctaaca gaaggagcca accacaaaga attgagagaa 660 

atnaacaact gggccttngg aaagaaangc nggccaagct gcttgagtaa tagtcaganc 720 

ctanggaaat gatntggtta atgaattcac cccaggncac acccngttga agagc 775 

<210> 779 
<211> 781 
<212> DNA 
<213> Homo sapiens 

<400> 779 
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gcttttnann 
gatcccatcg 
ttagtagtag 
ggaaaggagg 
gacaaaattg 
cctacattaa 
aaagaagctg 
aactccaaat 
tgctcaaagt 
atgtggttcc 
cttaagaaat 
ctttcataca 
tatgaagatg 



nccctncttt 
attcgaattc 
gttatgaaac 
aaccaccaac 
gtcagccatc 
tagaactctt 
caaggtggat 
gtgcaaaata 
ttacaaggga 
aaacagaatg 
gtcatgagat 
tactggatga 
tactttnaca 



cnaancctct tcaaatcctt ggntatcgtt ctntctgnng 
ggcacgagag acaaagaaaa aggtggcaat catagaagag 
ctctctaaaa agctgccggt tatttaaccc caatgatgat 
cacattactt tgggtccnnt nctacttggc acaacattat 
tattgctttg gagtacataa atactgctat tgaaagtaca 
tctcgtgaaa gctaaaatct ataagcatgc tggaaatatt 
ggatgaggcc caggccttgg acacagcaga cagatttatc 
catgctaaaa gccaacctga ttaaagaagc tgaagaaatg 
aggaacatca gcggtagaga atttgaatga aatgcagtgc 
tgcccaggct tataaagcaa tgaataaatt tggtgaagca 
tgagagacat tttataggaa atcactgatg accagtttga 
aggaagatta cccttagatc atatgtggac ttattnaaac 
gcatncattt tacttcaagg cagcaagaat tgcttttaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
781 



C 



<210> 780 
<211> 783 
<212> DNA 
<213> Homo sapiens 

<400> 780 

gnntttnnan nnccngnttt ctaatnctnt tcnaatnctt tgnnancgtt ctntatgcan 6 0 

gacccatcga ttcgggaatc tcctagaaaa gttgtgattt tcgagccata tccttctgtg 12 0 

gtagatccta atgatcctca natgttggcc ttcaacccca ggaaaaagaa ctatgatcga 18 0 

gtaatgaaag cactggatag cataacttct atcagcnaaa tgacacaagc accatatctg 240 

gaaatcaaga agcaaatgga taaacaggac ccccttgctc atcccttact gcaatgggtt 300 

atatcaagta atagatcaca tattgtgaaa ctgccagtta acaggcaatt gaagtttatg 360 

catactccac atcagttcct tcttctcagc agtccaccag ccaaagaatc caattttaga 42 0 

gctgctaaaa aactctttgg aagcaccttt gcatttcatg gctcacacat tgaaaactgg 480 

cactccatcc tgaggaatgg tctggttgtt gcttctaata cacgattgca gctccatggt 540 

gcaatgtatg gaagtggaat ctatcttagt ccaatgtcaa gcatatcatt tggtactcag 600 

ggatgaacaa gaaacagaag gtgtcagcca aggacgagcc agcttcaagc agtaaaagca 660 

gcaaatacat cacagtcacn ggaaaaaagg acagcaatcc caattcctgc caaagccgta 72 0 

acttaaaatg catagncctt atgtgaaagg gatcaccttc atctggacct gcacaaacat 780 

ggc 783 

<210> 781 
<211> 796 
<212> DNA 
<213> Homo sapiens 

<400> 781 

gnnntncgcc ttcaatnctn ttcantctnt tcaatctttg aatcntcttt gttgtccatc 60 

gttcaattcg gacgagaccc ttatggcaga tccccacagt ctggggcaga agaggcgtcg 120 

aggngccaga agtgncggca gcagcagccg cagcagccca aagagaggca agagaaagag 180 

aaagcggccg gtggaggggt nnncggaaga gctggtcccc gtggttgagc tgggtccccg 24 0 

tggttgaatt ggaagaggcc atagccccag gctcagaggc ccagggcgct tgggtctggt 300 

ggggacgcgg gggttgcccc caatggtgca gctgcagcag tcaccactag ggggtgatgg 360 

agaggaaggg ggccacccca gggccattaa caaccagtac tccttcgtgt gagccaaccc 420 

cacccgctcc acccttttta aaccccccag cccttgctcg tgagattggg cttgggtagg 480 

gacagaagag gcccgaaatc cctcccccat gcttnctgac ccttgtttgg ccaaagggca 540 
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tctttgatgg tacaaagcag angcttcggg anaagcttcc gtcacaacac tncaaggtcc 
cttcccaagg gcaaggggat ttnggcttca tgagctnctt tgaggggctt ttttttggtc 
annccccacc ttnggggcca tttttcccaa ttaacttacc cccaacccca agncanggtt 
nagggggnaa agggctttcn anttccatta aagggggttt gtttgttgnt gttttaaacc 
aaaatgggga aancnn 



600 
660 
720 
780 
796 



<210> 782 
<211> 886 
<212> DNA 
<213> Homo sapiens 

<400> 782 

cggnnnnnnn gnagcccntt tggnaaangc ctctaaggga aangcctttt tgaaaacnan 60 

angaaaacct ntgggaaaag nccncannna ttttngngaa annggcnnga gcnnanantn 120 

ggacacngtt ntaannnnan nagngnnngt tttnnganan agggnnnnna gnggnannna 180 

ngngnnggag ggaannaagg nanagnannn ggnagnnaag gnnnnaaaga agnagnnang 24 0 

gaganggnnn gnggnggggc atgangnggg nncagaggca cgaggagccc aagaccatca 300 

cngangagna ngagcagggn accnacatnn acnnggacna cgagaagngg ggccagcgga 360 

agaaggaagg nagnacctng agnaccgnta ccaggaggan cgggaccnac agngacanag 42 0 

gnccnnnncn anacgganrm nanaaacgng aagcaggann nnnanggacc aagggaaggg 480 

nncngnncnn ggaaaganng ggagggaggn ncgaaggcaa aggggggann cgnnannncc 540 

aggaagnang gaaggggggn cgggaggnna annganaaga ngaaccnngg gggnncaggg 600 

gggcgagggn agcanaannn nnccnnagnc aanngaaggg gananaagag ngggaaaann 660 

aannagaaag agggaaaana agnnaaggaa anaaaagang ngnnaannng gganaaaana 720 

nggnganann gnngganaaa ngngnannan aaaanngagg aggncanngg gnaaanaana 780 

nggggagggn nganananag ngaarmagac aaggaanagn gaannagngn anagnanngn 840 

gnannaaagg nannggggna anaagnanna nannnnnagn gaagan 886 

<210> 783 
<211> 805 
<212> DNA 
<213> Homo sapiens 

<400> 783 

cnaatncttg ctcttgncct ntttcnaatn cttggcnact cgctttctnt gcggatccct 60 

cnnganncna tcgttcgaat tcggcacgag cacaaggaga agaaagttaa ttaacattga 120 

aagatgagaa gacatcttgg aagacttgaa ttgggccttg gaagaagaac agccattcaa 180 

atagatagaa ttgtggtagc aaaggcatac ngntcggaaa gtatagatct ccagggacag 240 

tagtcatggg gttggggcac tgttggaatt taaggttgga aggatatatt ggagcccctt 3 00 

gaatacggta acaaggcaca ccttgggcag tggagagtta tcagagtgtt tgaaaaggag 360 

ggttattgag taaataaata gactggtact ttaggaattt taaaatgtgg atcattgtac 420 

tactaataac tatntatttt atatttacta tctactaagt aatttacatg tattttcttg 480 

tactgactgt aaaccttctg ggtgtgggtg ttttaagtgc cattttactg ataaagaaac 54 0 

tgangcttaa atagntgaaa tanntcaccc tgttagtgag tggcacaatg acaagtcann 600 

atcttanggt tgccnanntc caaaanncat ttaaanttnn agnatnattg annnttttnc 660 

cttatggcnt nnnaaatttg gggagccatt attgaaatcc nttacnacnt angaattgnc 720 

caaaaaaaat actttttggg gaaaactgga tttattaatt atccaaaata atttnantgg 780 

cttgnttggc ttntttccac tntnc 805 



<210> 784 
<211> 776 
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<212> DNA 

<213> Homo sapiens 



<400> 784 



taatgctggt tactgccctt 
atcgattcga attcggcacg 
ataaagcctg caagttacta 
gcagngtgcc aaaggtgaaa 
tcgtanaaaa acgtctttac 
tattggatga atcatttgcc 
gagtatgtgg gtatagaaat 
ttaatgggta gatcataaat 
tgtgcccaga tatgtnttgg 
ttgctatagn gnnactattg 
gccaagtaaa ctgtactgtn 
aanttnagaa cngatgtctg 
catntttcgg ngctgaggtt 



caaatccttg caatcccttg gnaancggnc cngcngaccc 
aggttatatt aaattattct ttgntnttct ttgtctttta 
aattgnagtt ncataaattc tgtagtnaag tatcatcttg 
angntgcttn ctctaacaga gaaattctta gngactccag 
aacctgaata agatnganga attgngaaca taccatggcc 
ggnggctana ncagactgta gggtttgtga tggatntatg 
catgaatntn ccatttgnnn ncagagattc aagcntanac 
gacagaatga attcaaaacc tagcacgtgc attgtaaatg 
aaatggcagn tccttggggt catgtntcta ctggcaaaat 
nantgtaatt ataaaattna tcannattat ncaccgattn 
cataggaatt ttgggaattg tgcanaaatt ggatcaattg 
ggcttaaaaa tttatcnggg accacnnatt angaaactna 
cattgnccaa ggccangaag gtntttnccg aaaanc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
776 



<210> 785 
<211> 778 
<212> DNA 

<213> Homo sapiens 
<400> 785 

ttngaaaacn ccttngcttn gttnccccta cngaaaccct tttgaaaacc ntttgcnann 60 

tcctctttnt gnaggatccc atcgattcgt gaaagaggag atcggtgacc tgggctcctt 120 

atgtgcctga atgagtttga gtttcctgtt aactccaaat caacagtatt ttcaacaaga 180 

aatgtgcaat tgaaatcaag tgctgtttaa gtgcagctag gantccacag gaagacactt 240 

gcagtgaaca gagttatgga gcagcaaaaa cacagatcta tttggaaaaa gagaaaacat 300 

atgcgttgta ttttgcttca attataaaat accatcctct caaaggtggt tctaaattac 360 

aaaggacttt gatttctagg tagattctgg gtagagactt cctttcatat tgaggcatta 420 

atgacacctt ttaacctggg aagcaatatg actggagttg tactttgaga agattaatca 480 

ggtttggttg cagaatgaaa gagaagatga agtcaagaga ttggtttaga ggctctagca 54 0 

gaagcttagt catatttcaa aatgatcaaa tatcaagaaa aattctgagc tgcataactt 600 

gtataaagta attttcagtg atttttttca tggttatgat aaaagaactg gattagcaga 660 

aacttttacc ctgaatcaag atttaatttt tctttgagct catcttaagg atatcggaac 720 

atagggagca aacgatggtg tggctgcctc antgcttgaa ttttaacngt tttgaaan 778 

<210> 786 
<211> 805 
<212> DNA 
<213> Homo sapiens 

<400> 786 

ngccccccct ttcccccctn ttgaaanccc ctttggnana nnccnntttc aaatcncttg 60 

naaatccttg gcnactcgtn ctntctgcag gatcccatcg attcgaattc ggacgaggag 120 

aggatcactt gagcttagga gttcaaatcc agcctgagcc aacataacaa gactttgtct 180 

ctaaacaaaa cagttattgt ttaaagaatc tgaaatcttc atctttaatt caggtagccg 240 

tgaatcgagc ccaagtttgt ttgatatcca gttccaagtc tggagagagg catctttatc 300 

ttattaaagt atcgagagac aaaatatcag acagcaatga ccaagagtca gcaaattgtg 360 

atgcaaaagg gctatcaaag ggaggctttt tacagagaac taaggaagag aaggaggttg 420 
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ttaaagagac ttgagatcag aaaaagatca agaacaactt gaatctcaaa gtatgaattt 480 

gaagtatttt gctgagcaaa catttgaatg cctgtatgta ccgtaatcct ctatcactgg 540 

ggtccccaac cccggtacca gcccgtggcc tgctagggac tgggccgcac agcaggaggt 600 

gagcagtggg tgggcaagcg accattccca cctgagcttc ccctcctgtc agatcagcag 660 

cagcgttaga ttctcatagg agtgcaaaac cctattgtaa actgcccatg ccaagggatc 720 

tangttgcaa cgcttcctta tgagaanttg aatgcctgan ngaactgtca ctgncttcca 780 

tnaaccccca gatgggtact ngttc 805 

<210> 787 
<211> 775 
<212> DNA 

<213> Homo sapiens 
<400> 787 

ccttggnnag nngccccctt naaanccttt gaaaaccctt ggcaaangcc ctnncngnnn 60 

gatcccatcg attcgaattc ggacgaggag aggatcactt gagcttagga gttcaaatcc 120 

agcctgagcc aacataacaa gactttgtct ctaaacaaaa cagttattgt ttaaagaatc 180 

tgaaatcttc atctttaatt caggtagcac cgactcgagc ccaagtttgt ttgatatcca 24 0 

gttccaagtc tggagagagg catctntatc ttattaaagt atcgagagac aaaatatcag 300 

acagcaatga ccaagagtca gcaaattgtg atgcaaaagg gctatcaaag ggaggctttt 36 0 

tacagagaac taaggaagag aaggaggttg ttaaagagac ttgagatcag aaaaagatca 420 

agaacaactt gaatctcaaa gtatgaattt gaagtatttt gctgagcaaa catttgaatg 480 

cctgtatgta ccgtaatcct ctatcactgg ggtccccaac cccggtacca gcccgtggcc 540 

tgctagggac tgggcccgca cagcaggagg tgagcagngg gtgggcaagc cgaccattcc 600 

cacctgagct tncccctcct gtcagatcag cancagcgtt agattctcat aggagtgcaa 660 

ccctattgta aactgccatg cnagggatct aggttgcacg ctccttatga ggaattgaat 720 

gccctgatga acttgncact gncttccatc acccccagaa ngganctggc taacc 775 

<210> 788 
<211> 774 
<212> DNA 

<213> Homo sapiens 
<400> 788 

gaaacccttt tgtnaanagc cncttcaacc cnttctaatg cttggcaatc gctctntctg 60 

cangacccat cgattcgaat tcggcacnag attatttcca aagcagccta cagtagaaaa 120 

tagtcattat ggcagcagct tctgatgttt ttgtttggta ggttttctga tttcaatata 180 

tagaatcata ttcatagagt atcttctntn ccgcctngca caaagtaccc atttaaaatt 240 

tacatgcaca gttcattgcc acctttctta ggcctatgca tagttaataa ggttataatc 300 

tactcaacat ggaaaatgga gcctatttgc aaacacacaa gtaattaaag taccaattct 360 

ctcttagttt ctttttttat agttggttta ttttgcaatt ataaatgtta aacatcccta 420 

gagatgaaag ttaaaatggt tgatcacaga tcagtagcaa aatacaaatt gacaattcaa 480 

aattataaat aaaactctgt tgaggatgtt taactttgag tctccaaatt taagagctaa 540 

gcttggaaga aacaaattta taggttatat ttccctctta aattaaanaa acaaacttcc 600 

tctggcagta gtttggtgaa ttcctttcat tgnaatgata ccatgattac aggatcaaaa 660 

atgcttaact tacttgccat tctgctcaca tcatcacagg ttgttntttt tttaaagcac 720 

tcnatgtagg cattttaaac cttcnggata accagagtat cttttgagaa annc 774 

<210> 789 
<211> 773 
<212> DNA 
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<213> Homo sapiens 



<400> 789 



ngcccctttg aanccnacng 
attcgaattc ggcacgagag 
aactnntggt gcaactatgn 
anananatcc tgaatnctcg 
atatgtatag aacggnttaa 
gngccnanaa ngctgncgat 
nantnnctaa ngatgaattt 
ttatacgaag ncnangaaaa 
tctctaaaaa gntgacggca 
acattanttt ngggtccagt 
ctattgcttt gggagtaccn 
nctctttntc anganagctn 
nanggnggan tggattaggc 



aaatcctttg gcnantcncn ctntctgtng gatcccatcg 
cagatttgng ataaacntnn tgnaggttna accnaagggg 
ngnttggaag atgctgcnta tgtttattga ggattgcann 
ccntttncaa aggcttggat aaagcactca agccagctac 
aatcnatgag gaagcctgga ctaaatatnc catnggactg 
gaactttgna tctggnnaga agtntaaaga atggcaggat 
cannacnggn nnnccaccan tcttnaatnc tttaagatca 
ggtggcaatc atngaanaat gngnatnatg ttangaaacc 
ctttaacccc natgatgatg ggaaggaggn accaccaacc 
actacttggc acanccttat nacgaaactg gncngtncnt 
taaaatacng ccntngngag tncacctnca atgaatnnaa 
nngatccata ngacntgctg ganatnttta aggaancttc 
ncaggccntt ggacacancc ntncttnatt trie 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
773 



<210> 790 
<211> 953 
<212> DNA 
<213> Homo sapiens 

<400> 790 

aanannnngg gnnnnnnnnn nnnnannnnn nnnnnnnnan nggngnnttn aaanccttnt 60 

aanngncnnt ncngcttnaa accttggnaa ncnccgcccn nttgeannaa angngaannn 120 

atgcttngtg aagectgann ccaaanctna aggnanggac ctggatcccc ttatatngaa 180 

naancggtnt ggaggaanga gnntgtcngg gaggatgggg cagaaaatga ngnnggcaga 240 

ntggncccgg gggctctgea naccagcctt ggagectget cattctgggc ccttgctgcc 300 

aagganccca gcctnaccta gcangaaang anatgaaagc ccttctccca ngaggtaggg 360 

tetaggctge ccnaacttaa atgeattnag aaanctcnta gatgtggaaa natttttneg 420 

aacctgaaaa tgcagctggt anaatntcaa tgggaagcat aaatncatgt aaaatataat 480 

tnagntngaa tatnanngta aaaatgeact tttnngcggt gtgacngatc ctgggnnccc 540 

annatctgnn attnaagngn tttacnaang gaanggaaag gacctttncc taaactacct 600 

ttttgaacag ancattaaga angnncnttc ttttaagnaa aaaaaaatca aattttgang 660 

aaaantggna ttngaatgtn nagaaaaang gatananaan aaaanccaat nntaannacc 720 

nannctctct gganttcnac tatctccact acntacntnt acntatngcg ntaanatnna 780 

ctnttacntc nnnntantcn cacanacntc ntcnaacnta atnangenen canaatcctc 84 0 

tatannatnt antgtnnntc acannnenna cnggntaant ntnnncaacg ccatatcacc 900 

nctnnnatcg ncnagntana taacacntat atcgncactc ncacananac tec 953 

<210> 791 
<211> 798 
<212> DNA 
<213> Homo sapiens 

<400> 791 

tggnancgen ctntntgttt gatcccatcg attcgaattc ggcacgagga tcattgttaa 60 

ttagtgacat agtaacatct gtagcagctg gttagtaaac ctcatgtggg ggtggggtgg 120 

gggtgtattc cttgggggat ggtttgggcc gaatggggag tggaatattt gcncttcncc 18 0 

tgttttaaat tctaggatag attttaacat ectttgeggt cccagtccaa ggtangctgg 240 

tgtcatagtc ttctcactcc taatccatga ccactgtttt tttcctattt atatcaccag 300 
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gtagcctact gagttaatat ttaagttgtc aatagataag tgtccctgtt ttgtggcata 
atataactga atttcatgag aagatttatt ccaccanggg tatttcannc tttgaaacca 
aatctgtgta tctaatacta acccaatctg tttggatgtg gattttaaaa aaatgtttgc 
taaacctacc caaagtnaga tttacctgna tttaaatggc ctttngggtc ttgaaaaagc 
tttntnaacc tcttggcttt aaaatgcgtt ttattctnga taagatactt cnaaatancc 
tnncaaaagg tgttngatnc naattacttt aaaataaaac ctgtaattgn ataatgncat 
aatgntgntc catgcctnan tccccttcta gnnntanaaa cntnantaan aantatatca 
atrmtcgatn aaatnntann actataaaaa ctncggccct cttananact tnatncttga 
agttctcant ataaccnc 



360 
420 
480 
540 
600 
660 
720 
780 
798 



<210> 792 
<211> 788 
<212> DNA 
<213> Homo sapiens 

<400> 792 

ctnttgttct ttttgcagga tccatcgatt cgaattcggc acgaggcaga gctcacatcc 60 

tgtgcgcagc atcttctgtc ccctcatgtc cttccgccag ggggcctgcg tggtgacggg 120 

cagtgaggac atgtgcgtgc acttctttga tgtggagcgg gcggccaagg ctgctgtcaa 180 

caagctgcag ggccacagtg cacctgtgct tgatgtcagc ttcaactgcg acgagagcct 240 

actggcctcc agtgacgcca gcggcatggt catcgtctgg aggcgggagc agaagtaggg 3 00 

tcctgtcngc cctgctgctg tcctccatcc cacccctctt actccacctc gtgttgtaaa 360 

taaagtttcg gtggtcatgc tganggccgg ctcccagctc tgccggggac ggacagggca 420 

gaaggcancg ggcaacttca ggaacacggt gaaaaaaaaa aaaaaaaaac tcgagcctct 480 

agaactatag tgagtcgtat tacgtagatc cagacatgat aagatacatt gatgagtttg 54 0 

gacaaaccac aactagaatg cantgaaaaa aatgctttat tttggggaaa atttgggatg 600 

ctattgctta atttgnnaac cattntaaac ctgcaaatta aaccaagttt aacaaccaan 660 

caattggcan ttcattttta atggttttna aggttcaagg ggggaaggtt tttgggaagg 72 0 

tttttttaaa attnncgggn ccnnnggngc ccaatgcatt tggggccccg ggnccccaaa 780 

nttttttt 788 

<210> 793 
<211> 806 
<212> DNA 

<213> Homo sapiens 
<400> 793 

gaatcccttt gcttctgtcc tttaagnnat cgttggaaca accatgnctt tttgtaggtg 60 

aagtgttctc tctgcatgca acagtaaaaa ttaatataat atttttncca caaaagaaac 120 

acttaacaga ggcnagtgcc aatttataaa atttatgatc taaaggggga aatcatggat 180 

tataaagtcc ttcagccctt tgggactcta aattggnggg ggattaaaaa gaatttaaaa 240 

taattttnga accgaattta ttttcccctc agtttttgag ggcattaaaa aggcattaaa 300 

tcaagacaaa tcatgtgctt gagaaaaata aaattaatga aaacncagca ctttatgttg 360 

gtttaacntg cancctnctt tggaggtaga atttatttat ttaaaattac tgggtgcatc 420 

angaacccat agggtgtaca aaangttcta ttaaaatctg cnttatagag acaaagaggc 480 

aggcaaatcc atgtnacaaa gggtaaagct tacagtttac aaactgngaa cgccanggtg 540 

taggatataa aaacgcactc ttgagaaaac anatggtcat cagggtgctg aaaacttgca 600 

tggtgctttt caacattagc ctttggtcca caaatttctt gtatttgaca ggatccatag 660 

tgtgccatgg ggcaaganac nattttgccc tctatggtnt tctttaaaaa ttttcanttt 720 

aaaaatacct cttttnncag gaatcctaat tttggcnccg aagcntattn ntggtnccac 780 

atttaccgtt gcccttgccn ttggan 806 
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<210> 794 
<211> 815 
<212> DNA 
<213> Homo sapiens 

<400> 794 

tttcaaatnc cttggcttta nccctttgtt tganntcctt gttcgaattc ggcacgaggc 60 

cttctctggc ctcaccaatt aggtcaaatg ttccttattt tgtgttgtgg ggcatggctc 120 

tncctgtgag gacctgtccc agcttggacc tccgccttcc tgcgactgta ttggtgtctn 180 

tccctctcaa gcctatgagc tcttgcaagg gcagggaccc tgtatgattt tgcctatcgt 240 

atgtcctcca gcccccagca cangcgcctg gtgtccagtg agagctcagc aaatactttg 300 

tgagttaaan gacangcggg cttggggtag atggatccgt ctgcctanac agggcangtt 360 

attcccgctt gtgagcaact cttaanagaa acttcatttt ttttcggcgc ctgcncgaac 420 

tttcaaagat gtttcccggc cangaacngt ggctcacacc tgtaatccca gcactttggg 4 80 

aggcttgaag tgggtngatc accttgaggt cangantttn tagaccagnc tggccaacac 540 

cggtgaaacc ccgtcctctn ctaaaaatac aaaanttaac tgggtgtngt tggtngggcg 600 

ctttgnantc tcactacttn ggaangctga ngcnatgaan aatttgcttn aaccccngga 660 

nggcngaagt ttcaattgan gtcnanactt nancccattt gcgccttcan accctggggc 720 

aacangtatc annaacttna acnattaaaa aatnaanana nctcttatcc ctttannaac 780 

nattattgan gntacntatt ntcntagaaa tccct 815 

<210> 795 

<211> 1050 

<212> DNA 

<213> Homo sapiens 



<400> 795 



tttctaatgc 
gcggcgctga 
ttcctgcctc 
cccatgggac 
ccctgttcca 
ccagcnagcg 
tttgcagaaa 
ctgncctgtn 
cntgntcaca 
tacntcntan 
ncatgnantn 
anannaancc 
atataanntn 
ggnnactatc 
nantnnnntc 
natanngnnn 
nannnaanng 
tacnccnntn 



ttggctttga 
cccggncggn 
caagacctgg 
ctagactcac 
gctcccgacc 
gncttcagcc 
cccananggg 
ctgcatcatt 
atcaancatn 
nnnnganttn 
ttagtntntg 
ntccttnnag 
ntntnctnnc 
ncatantaca 
tcctnnanat 



anatcnannn 



tatnntngtt 
anntaannat 



gncctctntt taaaatcctt tggcnactac tctgcacgat 
cccacacccg ctctttnctc ttctttgccg cggactccct 
gtgtctacaa ctgtgagccc agcttgnncc aaaggcagtc 
cttncccttg cctctatgaa accttctgct tgggcccanc 
tgcacttcct tgctgggact cangcctcca agctccctgc 
accgtcttcc cctttctttc gggccctgnt tgtnagcanc 
acctngtgcc ccttgcnaag nctgtcgcct tggtgcaaga 
ttncatggtt gncgggggtg tggggntnnn cnngncgnnn 
tatncctnan ntngggtatn acnaatggcc tnaagantgc 
tcangnnntn ttactaacnt ncnatngnnc ntnganatag 
atntanccnc nattgcagcc ncataattat cctacaccac 
aanntgncnt ctatgnaana gnctnnnaat gtggcnncna 
atcntannnn nntcctacgt nannnnncat nnncnctntn 
tcnntnannn cacccatnct nntntnanat ntctcntggg 
ncnctaatna ngatctctca nntacatgan ntanatnacn 
ngtctctcnt atnnnttatn nanngntcan nttacnnnan 
cnaaanntat ntataaancn ncgtnnnttt nnannagatg 
ctangctccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1050 



<210> 796 
<211> 884 
<212> DNA 



<213> Homo sapiens 
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<400> 796 

ggnnntttng agctcggaaa tcncttnggt nnagcctttc nttgacccca ttgttcgaat 60 

tcggcacgag acggcctggt ggagcagctg tncgaccttt ncctggagtt cctgcacagc 12 0 

caggcacact gcatcggctt cccggacctg gggctgcctg tggtcctgca gntgaagtcg 180 

ttcctccggg agtgcaaggt ggccaactac tgccggcagg tgcagcagct gcttgggaag 24 0 

gttcaggaga actcggcata catntgcaag ccgccgccag agggttncct tnggcgtttc 300 

tgagcagcag gcagtggaag cctggganaa gctgacccgg gaagagggga cacccttgac 360 

cttgtcctac agccacttgg cgcaagcttg cgttgaccgg ggaagatcca acttgggaga 420 

tcaanngggc aaaagaaccg gcttggaaag acctggaact ttcccttgag atcaaaaccg 480 

aaanggaaga atgggcttga canggaangg atgaaggaca gggaagccaa ttttaaaaga 54 0 

ccctctttga cctgnacaag ctcttgaaaa aggacgacac ccgaggggat tcttcggaga 600 

nnagggatac tgangccccc tgagcacctc ggcatggggg tngggaagac cattgnaaac 660 

aaggaccaag gaaggaaggg ccnaaggaag ggacaagcan ncaaactcgg aanggntgna 720 

atgggncctt ngggantngg aggaacccca naaccccaaa aaggccgggg ggcttgggcc 780 

cccttggggg gaancttnnc aacaaaatnt gggccccaag ggggccccgg aaaggaacga 84 0 

aaccttggaa gggaatcttg ncaagcttct tanaaaaggg ancg 884 

<210> 797 
<211> 773 
<212> DNA 

<213> Homo sapiens 



<400> 797 

taatgcttgg ctctgtctnt tgttgacccn tngttcgttt gtgcctgagc acccacaatt 60 

tcaggattta gactgtgtgg gcacctcagc tttcctctgg ntgtaaccac tccttggtga 120 

nagagggaac tcctaaccan tcccatttgn caaaggctag gcaatcttca ttctgcttgg 180 

ctttagtcat tcttgtcatt gggctgcaga agaaaaacaa ctttgctggg tgatcccact 240 

gccttgattt cacctcggan cgaggctggg ccatgtccaa gtcttatgag gtcaccctga 300 

ctagaaaaaa ttgaactcac ctacaaatag tctgaaagag tggtgtatat caaatacgtg 360 

ggtagtgttg catttcaaat gangctcttc tgggttgaaa tgatatattt ataaaaccag 420 

aatatcaaaa atgggtgatg tataatgtct ctttagtttt tttggtattt ggcctctttt 480 

aaagcctgtc ngatgtatgg gagaaaaaca atgaaccgtg ctttgatttc ctatcaagtc 54 0 

actcttaaga acatacatat tggttaaagt aactcggtct ttttttatct gattctttga 6.00 

ggcactatgg gtagcaaaat aaccacttac aaatttaaat gtaatataca cttcttttct 660 

gngtgtcaag tccttatttt tangtgccta attggacatt ttaaaaggtt aaattattng 720 

gttggcatat taatntcaaa aaatctatta attnatttta atgcctggta ccg 773 

<210> 798 
<211> 812 
<212> DNA 

<213> Homo sapiens 



<400> 798 

gtcaatnctn ttcatgaccc tatcgattcg aattcggcac gaggctggag cacgctggag 60 

aggccatccc tgccctggca gcccgcggct gggggagact cctttgcccc attctttgcc 120 

ggtttcctgc cattattggt gtgcaagaca aaacagggct gcacagtggc agagaagtcc 180 

tttgcagtgg ggaccttggc agagactatt cagggcctgg gtgctgcctc agcccagttt 240 

gtgtctcggc tgctccctgt gctgttgagc accgcccaag aggcagaccc cgaggtgcga 3 00 

agcaatgcca tcttcgggat gggcgtgctg gcagagcatg ggggccaccc tgcccaggaa 360 

cactttccca agctgctggg gctccttttt cccctctggc gcgggagcga catgatcgtg 420 

tccgtgacaa catctgtggg gcacttgccc gctgttgatg gccantccca ccaggaaacc 4 80 
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agaccccaag tgctggctgc ctactgcatg ccctgncact gaaaggagga acttgnaaga 540 

atgggtcacc atttgggcgc ctttttaact ttctgtacca gancaacccc ttgacaaggt 600 

tataaaatgt nggctccccg aaccttnttg cgtattcttg cagncctcaa ttcttggctt 660 

gaccaaccaa aggattccca cccangaaaa cccnaanggg cccnaaactt gttnncttgn 720 

ttnccttgga ccgtttcctt ggggccaaaa acaggnanaa cccggacang gttttttnaa 780 

accagntttt tggggcttta aattggcctt gg 812 

<210> 799 
<211> 758 
<212> DNA 
<213> Homo sapiens 



<400> 799 

ctaatagctt ttcattcnaa tgcttgtgat ccctcgattc gaattccgtt gctgtcggac 60 

agattgccct agtacccacc cacctatcag ggttatgcaa tggaacatcc tcgcccaagc 120 

tcttggagaa ggcaaagaca actttgtaca gtgccctgtt gaagcactca aatgggaaga 180 

aaggaaatgt ctcatcctgg aagaaatcct ggcctaccag cctgatatat tgtgcctcca 240 

agaggtggac cactattttg acaccttcca gccactcctc agtagactag gctatcaagg 300 

cacgtttttc cccaaaccct ggtcaccttg tctagatgta gaacacaaca atggaccaga 360 

tggttgtgcc ttattttttc ttcaaaaccg attcaagcta gtcaacagtg ccaatattag 420 

gctgacagcc atgacattga aaaccaacca ggtggccatt gcacagaccc tggagtgcaa 480 

ggagtcaggc cgacagttct gcatcgctgt tacccatcta aaagcacgca ctggctggga 540 

agcggtttcg atcagcttaa ggcttgtgga ctcttcagaa cctgcaaaac atnacccaag 600 

gagcccaaga ttncccttat tgtgtgtggg gacttcaatg canaccaaca gaanaaggtc 660 

tncaaacact ttgcttcttn cagnctnaac cttganagnc ggcctacaag ntgctgaatg 720 

cttgatgggc aatttagaac ccccatacac ctacctgg 758 

<210> 800 
<211> 770 
<212> DNA 
<213> Homo sapiens 



<400> 800 

ttnaaancng cnttggactc cttgcaggat cccatcgatt cgtttaaact gagctccaaa 60 

tgacgttcaa acacccctct cgggtagagt tttcatggtg gaacggttgc gcccaccaaa 120 

cagaagctta tgtttttggc acagaagcct gggccatttt catggacacc tggctggacc 180 

tcggtggaag tgaactccgt aggttgttgc gttcactgca gcacctcaca tgataccgtc 240 

ccctctcatg gaacggagcc tcccccatgc agcccccact caaatggagt tttaaaggct 300 

gggttcaggt tacgggggcg tttctcaccg tctgaatgcg gaggacagag acnagctcca 360 

gggagcgtgg gcgggtgacg gcgctgagat gcgtgatgtc tcggaaacgt cctcgcatcc 420 

ctcancgcgg gcgctgactg ccgcggccct tgcctgtctt caggagcgct ccagcttcgc 480 

ccacacaccc cgggctgatg tcccctcgct ccggcggcct gcagacccca nagtgcctgt 540 

ctcgggaggg ctccccattc acacgaccct gagtttgggt ccaagttagc ttctgtccca 600 

aagtaccngt attcccaaag cgcacccggt aaagganccg ggccggncct tntttgcggg 660 

gccgggggcc ggggccggga actcgtnggg ggttgccngg aanggggtta accgtnccgg 720 

ttnttccgnc cttncgtgca aggcttnccc cgttaagngg cccaaaccnt 770 

<210> 801 
<211> 573 
<212> DNA 
<213> Homo sapiens 
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<400> 801 



ggagccctag agctccacaa caggactcag agcctctaac cagttccagc actccagact 
ccagccacac tccaacacag caccatgatc ccagccaccc gctcgcttct ctgtgcagcg 
ctgctgctgc tggccaccag ccgcctggcc acaggtaggt ctcgccactg ccactggggg 
aggagggacc tctggtgagc gcagcctccc acagtcccgc tgaccaagag tcttctccca 
tagggcgcct atcgccaatg agctgcgctg tcagtgcctg cagaccatgg ctgggattca 
cctcaagaac atccagagct tgaaggtgtt gccctcaggg ccccactgca cccaaaccga 
agtcatgtga gtatcttccc ggttagcttc tgccacttcc agactcgccc aaaccctccc 
gcgcccccac acttctccta gtgggaatgc ctaacatgtg ggtctatcct tctctctgca 
gagccacact caagaatggt cgcgaggctt gccttgaccc tgaagctccc ttggttcaga 
aaattgtcca aaagatgcta aagtgagttg tga 



60 
120 
180 
240 
300 
360 
420 
480 
540 



573 



<210> 802 
<211> 1390 
<212> DNA 

<213> Homo sapiens 
<400> 802 

tttttttttt cacaaggaat atcattttat tactgtaatc acaaaatcgt aatttctgta 60 

caggaatgta taagtgaaca ttattcaaag cattggtaat tcacttcata aagagggtaa 12 0 

acatactaca gaacatattg taaagaaaaa atattgtaaa attttctggt cttgcagtgc 180 

actatttagt gcaagtattt aagacacaat agtgttcaat tcagcaaagt attgcagaat 24 0 

gtcatgccac agtccactta attcaaagag ggtcaggaca tgcagcttgt aataaaatgt 300 

cagagtgtgt gtgtgtgtgt gtgtgtgtat ataaaaccac atgtaattca taaaatatat 360 

agtggtttat ttagatggtt ttaaatgatt tcactgtgga atccagcata actggaacaa 420 

catccaaggt cttcttaacg gcaacaatct tattgctagg caatggcytt ggcttcaggt 480 

argaatgcyt cccagtattt tatcagctgt tgttgtgttt gaactagtga ttctaagtac 540 

ttgatgataa cggttttaaa atccttcact cgttctttct caaatcttcc cacttctttt 600 

cgaatcgttt tagatatctg ttcaaaatct ctttcccctt gttgcacttt cgcctcccac 660 

tctcttattt catttttagc ttgctgtatt ttatctggtt tgttagcaac catcattttt 720 

gcttcagctt cacgtttttt gagcaaagta atttgagcat cttcccattt ctgccagcac 780 

ttcattcgat ggtcaaacac acctttcact gcagcaataa gacgaatgta gtcactaagt 840 

agttctgaaa acatataaaa gtcagmaaaa gcttgttctt gatgtaactg gtctatcttc 900 

tcctcaacct ctgcaagctg agacaaagct ctagataaag cagtatgatc ctcagaatta 960 

cctaacatgg cagcactttt agcaaaggca gctgtgttgg ctgaaagttc ttttctatga 102 0 

cagamcaagg cttcaacact gacatgaagt ttcctaagtt gctgatccag attctcaaat 1080 

tgctgctgct tttcttcaaa ccatgcatcc gattcattca tcttgattgt cattttgttg 1140 

acagcgtcgg cagccttgtt caccatcctc aatattcctg ctccactcag agcctgtgta 1200 

ttaactgctc taggcagctc tgaactttcc aagaactgcc ttaaatcagg atcctgtagt 1260 

aaagttggat gttttactgt tctttgaaga tacctttcaa gagctgctct ccgtttttct 1320 

acaaactcag tggatgatga gtcttcttta cccactttga ccttggtcat ccctactata 1380 

ctcttttctg 1390 

<210> 803 
<211> 947 
<212> DNA 

<213> Homo sapiens 
<400> 803 

ggaacttctg agtaattggt atcatttcct agtgactcgg ctcttgtact ccaatcccac 60 

agtaaaaccc attgatctgc actactatgc ccagtccagc ctggacmtky kkcwsgsagg 120 
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ngagagcagc 
tcaagtaatc 
agacctgctg 
gagagagttc 
gctgggggtc 
cattgagcgg 
gcagcggcag 
ccgcaacaat 
tgccacgctc 
tttggatctc 
gggaaagtat 
tctccttctg 
gacagacgcc 
tgagttgatg 



ccagaacccc 
aaagagtgca 
gaccactgca 
ctcctgctgg 
gattactttg 
atacctctga 
atgactgaac 
cgcctgggtt 
gtgtcagaca 
attgacaacc 
cgcgagttcc 
tccttgatga 
ttgccccttt 
cggtgtctgg 



tggacaacat cttgttggca gcctttgagt ttgacatcca 
gcatcgccct gagcaactgg tggtttgtgg cccacctgac 
agctcctcca gtcacacaac ctctatttcg gttccaacat 
agtacgcctc gggactgttt gctcatccca gcctgtggca 
attactgccc cgagctgggc cgagtctccc tggagctgca 
acaccgagca gaaagccctg aaggtgctgc ggatctgtga 
aagttcgcag catttgtaag atcttagcca tgaaagccgt 
ctgccctctc ttggagcatc cgtgctaagg atgccgcctt 
ggttcctcag ggattactgt gagcgaggct gcttttctga 
tggggccagc catgatgctc agtgaccgac tgacattcct 
accgtatgta cggggagaag cgttttgccg acgcagcttc 
cgtctcggat tgcccctcgg tctttctgga tgactctgct 
tggaacagaa acaggtgatt ttctcagcag aacagactta 
aggacttgac gtcaagaaga cctgtgc 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
947 



240 



180 



<210> 804 
<211> 532 
<212> DNA 

<213> Homo sapiens 
<400> 804 

cctctgccct cccaggttca agccattttc ctgcctcagc ctcccgagnt agactgggac 60 
tgcaggtgcg catcaccacg cctggntaat ttttgtattt tgagtagaga tggggtttca 120 
ccatgttggc caggctggtn tcgaactcct ggccctcaag tgatccaccc acctcagcct 180 
cccaaagtac agggnttata ggcgtgcgcc antntgcccg gccgagaaca atttntcaca 240 
agnttacttt tctagttttg ccaatgcatg gtgaaagtga acccaagcct gggaactgca 300 
ggcctagaca atgcaggrmm ykksttsamm cwsrsmsrmr smsstysmar ywmrsssagm 360 
cttggaaagg agaagtgtga ggcaggtgtg ggtaggacct ctttttagta cctagaaaaa 420 
ggctaagaaa gtggcctgga gatgtttaga aggttaaaac caacgaagaa aaaaatcaat 480 
gacaacctat caggaacgtg attgactctc agaatggaga actggcgaat eg 532 

<210> 805 
<211> 552 
<212> DNA 
<213> Homo sapiens 

<400> 805 

aatgcattnt tgatttttta ttgcagatga tgaaaaagtt ttagatatag acagtgeega 60 

tggttacaca atgttgtaaa tgtatttaat cccacttacg aatgattaaa atgataaatc 120 

ttatgtttat ttcatcacta ccaaaaggct gtgggtgcag gggtgctggt ttctggtcct 180 

agectaagag actggcagtt tccaccttct atctcttggg acagtagctc tgggagcect 240 

gagctgtcat gcaggaagtc cagctaccct gagaccacca tgctggaaag gecacaggga 300 

ggagctctgt ggacagtccc agctgaacct tgccttccag ctgtccctgt caagatgeca 360 

ggsatgtgag taaagecate atggacccty tagaccagac tgcccaccag cagggtaccw 420 

tctggcagcc acatggagca gaagaacege ccagctgagc cacttccaaa ctcttgaccc 480 

actaagtcat gatccacaat gaacccatca tagggatggt tggctttgea gtgtggataa 540 

tgaggatgtc at 552 

<210> 806 
<211> 1646 
<212> DNA 
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<213> Homo sapiens 
<400> 806 

aactagtata tttacaacat cagaaacttc aatatggaga tttgttgttc ctatatcatg 60 

atctttagca gcaactacac cataggcact gcacaacctg ggtcctagat caggacgtac 12 0 

aaaaaatcct ggcaaatgag aggccaaatt gaattttcct tctggattac aatattctgg 180 

caatggcaga ctttttaaaa gatcttcgta tcttgctggc atcatagtct tgaagtcttc 24 0 

tcctgaaggc caatctttca attttaaaac aactgtttct ccactcttgt ttttctgccg 300 

ttttgaaact tcttcaaaac catcccagaa ttccttaaca ttggcatttg aaatgatgct 360 

atctttgcag ttcaggagat cagcttggtg gtctccaaaa tcaagactaa ttgattccgc 420 

cttccatagg ctaatgttca ttttcttatg cacaccagaa accactgcag gctgtccttg 480 

tttccaacat tctttgaaaa gcttccaatt actgctattc ttataatcct taagccataa 540 

aatatgcttc tcacagatcc aagaatgtgg tatatcactg tataatttat tattttcatc 600 

cactgcagat attatgcttt cttcaggctc ttctttaagc tctggtttta catttatctt 660 

ggaggtttta cttggtggaa ttttgttttc aacaactgaa gcaattatgt catcaagaat 720 

gttaggcata gtccgtccac ttttgctact tggggctccc attgaatata ctggggcaaa 780 

ggcaatgcca gcatctgtas accccacacg tagctttcca gctgttgtag tcagcaaatc 84 0 

ccgtaaggtt gagccttgtt cattattctg ggacacaaga ggtgatgttc tgccatttgg 900 

agattcagag ttgtcttgtt ctctttcttc tttaatttgg ttttcaaggg taagttcttt 960 

gttttctttt ttttcctctc tggctttttg ctctgcaaga tctgctaacc agtgcagtgg 1020 

tgactgggat tctggaggag ttaacttgtt atctgtgcct acatcactct ctgggctgct 1080 

gccaccattt ttctcagact tcggaggagt attttgctgc tgagactcag gcatgcacag 1140 

agaaatttta ttactgtgat taagaacatt ctgtaaaact tgagatacac cattcattgt 1200 

aggaaaattt ccaacttgta aattctgttt gttagtacaa tgacaatggg atttaatacc 1260 

atatttttcc ctaagagtgt gcatggcatc tagaagatct gtcaaaacag aaccaggtat 1320 

aatttgggtt ggcattaaat gtttgtgatc atgaggctgt ccyttcacac acttcatcca 1380 

agcatattag ttctttatcc ctagramyyc tycctttcct ttngccttgt aacaatctaa 144 0 

gcaganccac aawkccacat tttkggcaga cccagtnraw kktaancawk gntgcttcac 1500 

atgcatcaca catctcccgg actcctctca ctgctctttt ccaggcaatt ttggcatcct 1560 

ttttcaccca ggacaaagct gttttttcag atgttactaa ttgacagaac ttatcaccta 1620 

ttatatccaa gatatattta gaagtc 1646 

<210> 807 

<211> 1029 

<212> DNA 

<213> Homo sapiens 

<400> 807 

tggggctgtg actgtattta cttcattctt gaatcccgcg tccccgtggc tgggggctga 60 

cacatccctg ggcaccactg tgacttcctg tgggtccctt cccttctgtc cctgactctg 120 

tagacccccc acaggaaggg tcctaggtag ggggaggttc ctcctccctt gaaaccctgg 180 

gccactctgt caaggcaaag ctctgggccc agcaccttgt aaaggctttg atgagaggag 24 0 

ctctggcttt tgctcagggc ctttggaccc caccctccag cccccaggaa tgcaggcgct 300 

caaagcctgt ggtnaggctg cccgaagcac gtgccgcagt tcttctggag tgggagcagg 360 

gggacagagc tttgggtaga ggagggtcac ctgcaaagct ggaatgccag gggagtgggc 420 

ggtgcctcca gctcctgggg gccagggtgt ctccatacct catgggcctg agcctgggca 480 

ggggtctgga gtgcacatag cccccaggca gggagagggc agtgacagga cagagccact 540 

catctgtccc aaagctgcac caaggggtgt cagcaacccc aacctactga cctactttgg 600 

gaccacaggc ccatctagtg caaatgaggc ccagaaagga gaaatgcttt gctcaacagc 660 

cacagtaggc tgacgtaacc tatgtaatgt agggtcaggg tgggcctgag ggatgancca 720 

ggtggtgggc aggtganaca ccaggtcccc tcctggcctc tgccccaccc agccctctcc 780 
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tgcacggcta ccagaagatg tccgggaaga acanactagc cctgagtagg gagtgtggtc 840 

aggtgcagag gagggcaggg gcccggatcc tggcccagaa acactctaaa acagaatccg 900 

atcctgagat gatccaaatc aaacagaata cttgacggaa atagtagagt ctgaaaatga 960 

tgcactctgc gcacacatat acaagacaca cacacacaca cgaatccacg cacacgaggc 1020 

acaccccac 1029 

<210> 808 
<211> 836 
<212> DNA 

<213> Homo sapiens 
<400> 808 

aaaaccgggt ataacacttt aatatagatt tgtggaactc tggcccttgc agccagaata 60 

cacatttata agccataaat aaagcacgca gaaaccataa attaatcgga cccgagacct 120 

ggatttcacc gtgtcaagat tgggaatgct ttttttttct ttttcttggt catttacaac 180 

agacccttac attatttttt ttcctgtttt taaacaatag tacaaccctc tggttctgtt 240 

aaaactacat ggttttacac cgagtcactc acaaaatttt tttttttttt taagtaagac 300 

ttccctgcaa caacagcaat ggaggagaac aacaacaaca aaaaaatcag aatctgcagg 360 

tgcttgaaga agcaggagtc tacacagtag tggaaaccgg aggctttttt ttaactttat 420 

attctttccc gttttcctcc ttatatagaa cgtggggtat ctgtgtggcc ctctgtttgg 48 0 

gacggaacrg ctgcagcggg tgaaggaaga ctgctgtctt gggggtgttg gggtgggggt 54 0 

gttatggatt tcttctccct tgcgtctctg caacaccgtc tccccaaagt ctcgaccccc 600 

acttgctctc tcacttrtcc tcgatccggg gtgccagagt tagccnggcc tgaagccgtc 660 

gtcttcttaa gaggagttca taatgggccg ggagtacacc ccctggtagt aggaggtatc 72 0 

tgcggccagg ggcgaggcgt ccaggcccgt tttgttcgtg accgggccca tggccaagct 78 0 

gccaggcatg ggggaaccgt agccggggta gtgcatcacc tgttcgtagg ccttga 836 

<210> 809 

<211> 1844 

<212> DNA 

<213> Homo sapiens 

<400> 809 

atcaggtgtt cctcccatgg caggagggaa gaaacccagc aaacggccag cctgggactt 60 

aaagggtcag ttatgtgacc taaatgcaga actaaaacgg tgccgtgaga ggactcaaac 12 0 

gttggaccaa gagaaccagc agcttcagga ccagctcaga gatgcccagc agcaggtcaa 180 

ggccctgggg acagagcgca caacactgga ggggcattta gccaaggtac aggcccaggc 240 

tgagcagggc caacaggagc tgaagaactt gcgtgcttgt gtcctggagc tggaagagcg 300 

gctgagcacg ccaggagggc ttggtgcaag agcttcagaa aaaacaggtg gaattgcagg 360 

aagaacggag gggactgatg tcccaactag aggagaagga gaggaggctg caacatcaga 420 

agcagccctg tcaagcagcc aagcagaagt ggcatctctg cggcaggaga ctgtggccca 480 

ggcagcctta ctgactgagc gggaagaacg tcttcatggg ctagaaatgg agcgccggcg 540 

actgcacaac cagctgcagg aactcaaggg caacatccgt gtattctgcc gggtccgccc 600 

tgtcctgccg ggggagccca ctccaccccc tggcctcctc ctgtttccct ctggccctgg 660 

tgggccctct gatcctccaa cccgccttag cctctcccgg tctgacgagc ggcgtgggac 720 

cctgagtggg gcaccagctc ccccaactcg ccatgatttt tcctttgacc gggtattccc 780 

accaggaagt ggacaggatg aagtgtttga agagattgcc atgcttgtcc agtcagccct 840 

ggatggctat ccagtatgca tctttgccta tggccagaca ggcagtggca agaccttcac 900 

aatggagggt gggcctgggg gagaccccca gttggagggg ctgatccctc gggccctgcg 960 

gcacctcttc tctgtggctc aggagctgag tggtcagggc tggacctaca gctttgtagc 1020 

aagctacgta gagatctaca atgagactgt ccgggacctg ctggccactg gaacccggaa 1080 
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gggtcaaggg ggcgagtgtg agattcgccg tgcagggcca gggagtgagg agctcactgt 1140 

caccaatgct cgatatgtcc ctgtctcctg tgagaaagaa gtggacgccc tgcttcatct 12 00 

ggcccgccag aatcgggctg tggcccgcac agcccagaat gaacggtcat cacgcagcca 1260 

cagtgtattc cagctacaga tttctgggga gcactccagc cgaggcctgc agtgtggggc 1320 

ccccctcagt cttgtggacc tggccgggag tgagcgactt gaccccggct tagccctcgg 1380 

ccccggggag cgggaacgct tcgggaaaca caggccatta acagcagcct gtccacgctg 1440 

gggctggtta tcatggccct gagcaacaag gagtcccacg tgccttaccg gaacagcaaa 1500 

ctgacctacc tgctgcagaa ctctctgggt ggtagtgyta agatgctcat gtttgtgaac 1560 

atttytccay tggaagagaa cgtytccgag tccctcaact ctctacgctt tgcctccaag 1620 

gtgaaccagt gtgttattgg tactgctcag gccaacagga agtgaagacg gatccagatc 1680 

tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtcct atgtctatgt atcgggtgag 1740 

gggtgggagg gttgctggag ggtgctttat tgggtggagg gcaccatgtc ccagggctat 1800 

caaataaaga atagtttggt ttttttttta aataaaggtt ttat 1844 



<210> 810 

<211> 489 

<212> DNA 

<213> Homo sapiens 



<400> 810 

gccccgctcc atgagcagtg actccccagc tcctcctggc accagtcccc agggctctcc 60 

tgttggtakw wmmwgctwyw ywtsyysswm mywmmycgkg racctcraga tctyyaccct 120 

aaaatarttc tgttgaattt caccctggcr atgtaaaytg akagcttatc ttcacagatg 180 

ysrganaakr gmcmayycmy cwkcaswcct swgncwmays tswrwcwrat ksmtkycykw 240 

kccctattta tgtaaaaata cagggtccct gagccagcct aaggcataag tgacttatcc 300 

ctcctccctg ctcacatata aattgtgtat ttagtgaaag gctgatcaaa grttcaaagr 36 0 

atgttatttg ttatctacct gtggacccag naggtcccca attccagtta tttccacctt 420 

tccaggaccg ggaccaatgt atatatgtaa ctggattggc tgttctcgtg tgtttgttaa 480 

aatgtgtgg 489 

<210> 811 
<211> 471 
<212> DNA 
<213> Homo sapiens 



<400> 811 

gccctcagcc acccccatcc ctgccccttc tgagactcac agcacccctt tccttcctct 60 

cctcccacct cctccctcag cccctcattc tccttgggaa tctgcagagg gctctgggac 120 

tcactgccgg atgtgaaatc caggcgtcag ctgtttccta ggcaagggca ggaaagtggt 180 

ctccagccct tgctccactc atgcctgggg gnctgggsyy gagtggtatc cctacctggc 240 

ctccccctgg cctctggcct ccagcgctgg gtttgtcgag tgagagagag agaggagctt 300 

gggttgcttc cctgtccccg ccccctctgt ggcattgtcc ctcccactct tatttttcta 360 

ccaattgcta tttttccgaa caatccttgt agagtatgta ccatccaaag gcaggagggc 420 

cctcggtggc cggctctggt tggagatggt acagttttat tgtacaggtg c 471 

<210> 812 
<211> 579 
<212> DNA 
<213> Homo sapiens 



<400> 812 
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cccaatgaat caacatactt tattagaccc actaagtgcc aggggagggg cctgtgccta 60 

ngagccaggt tacagggctc acccgtagat tcagtctggt ctctccccat catgcctctc 120 

acttccagtc tgggcttcta ataggagggc cccgacttct tccctcccag tcattctctc 180 

gaatggagaa tctttcctca ttccagggac accaaggctc aggaaggggc ctatccatca 24 0 

tcagtagagc cagacaagct ctcccatcgg acgtcctgtg gctgggccca gaaatgggtg 300 

ccgctgcctg tgggactgcc cttccgggaa ggaccagggt gtcttcagtg ctcttggcct 360 

gcacgtggna ggagagtagg cagatgtctg gtgctcttta agctcaaagg catcatggcc 420 

ctctckgnwg sarcrrrsrs akamragkym sssatcncag scagcscwnk arskstsgca 4 80 

nwsmwcatts casmtgcasc mmcmggrrrs mkcsksywcm kmagnsktnm scmtsgsrgy 540 

cagcgcagcg tagggtggca tcctcattgc agatgcagc 579 



<210> 813 
<211> 562 
<212> DNA 
<213> Homo sapiens 



<400> 813 

tttttttttt tccagatgta actcttgtct tttattccag catctcccag agctccaata 60 

tgtacagact ttatttatac acatataata tacaccatat atacttattt atagatattc 120 

acacaccagc ccacacactc gcacacactc acacgcacac acccttccag gaggggcgtg 180 

tggctgcctt ggagtcccgc tagscccaaa caagtgatac tgggcttgcc aggcagttgt 240 

gaggttttgt gttttttgct tttaaaaaga aggccatttc ctccagatgt gtcctccctc 300 

tccccaagcc ctaaaactcc tccccaaaac actctgaaaa aaattttttt aaaacaagrg 360 

gnttttcctt tgctytggsc caagtagttt ctngganagn tccrggscca tccacaagny 420 

ccgtgcaggt cctagagcac gagagccggg cgtggccttg gtcaggcctg cagctgtgcc 480 

ctctgagggg agaggggagg cgctatagca tcaagggcac ctgccagatg aggagggtgc 540 

tgtccgtctc cccacacggg gc 562 

<210> 814 
<211> 594 
<212> DNA 
<213> Homo sapiens 



<400> 814 

agcctcgcct gggccggcct gtggctccca ttttcctttc agcgggacaa aggggacttg 60 

ttaccaggcc attttctgga tggcctgtga gatctctgcc cctccaagac cckccaaryc 120 

tsmsyckgwc scmswgytsk smsmmwgmmt ycwgcmsygs srarccntgss rryktswrkc 180 

tggcaccagg ctgnagnctc cccaatccca gcccactttg ctgtgtctct ggcgggctgt 240 

cctccttggt gggagctgtc ctgcacactg taggatgctt aaaggtatcc ctkgcctcca 300 

cccaccccta gccagcagct cccagtcaga caacagccag awatgtctcc agactctgcc 360 

cagcctcccc aggtagccac cctcgagaca cgacctcaga gtctctgtgt ctcctagaag 420 

cctgacagag acccccaggg cagtgggtgg gtngcgggct agagaccctt gcctgtntcc 480 

gggaccctgg cgccgctctc ccctcctgtg gatccctccg gactaacagt gttcttagtn 540 

ggcagangct ggggcacccc ttnggccctg ncaggcatng ccattggcgc angc 594 



<210> 815 
<211> 812 
<212> DNA 
<213> Homo sapiens 



<400> 815 
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gacggaacrg 
gttatggatt 
acttgctctc 
gtcttcttaa 
tgcggccagg 
gccaggcatg 



aaaaccgggt 
cacatttata 
ggatttcacc 
agacccttac 
aaaactacat 
ttccctgcaa 
tgcttgaaga 
attctttccc 



ataacacttt 
agccataaat 
gtgtcaagat 
attatttttt 
ggttttacac 
caacagcaat 
agcaggagtc 
gttttcctcc 
ctgcagcggg 
tcttctccct 
tcacttrtcc 
gaggagttca 
ggcgaggcgt 
ggggaaccgt 



aatatagatt tgtggaactc tggcccttgc agccagaata 
aaagcacgca gaaaccataa attaatcgga cccgagacct 
tgggaatgct ttttttttct ttttcttggt catttacaac 
ttcctgtttt taaacaatag tacaaccctc tggttctgtt 
cgagtcactc acaaaatttt tttttttttt taagtaagac 
ggaggagaac aacaacaaca aaaaaatcag aatctgcagg 
tacacagtag tggaaaccgg aggctttttt ttaactttat 
ttatatagaa cgtggggtat ctgtgtggcc ctctgtttgg 
tgaaggaaga ctgctgtctt gggggtgttg gggtgggggt 
tgcgtctctg caacaccgtc tccccaaagt ctcgaccccc 
tcgatccggg gtgccagagt tagccnggcc tgaagccgtc 
taatgggccg ggagtacacc ccctggtagt aggaggtatc 
ccaggcccgt tttgttcgtg accgggccca tggccaagct 
agccggggta gt 



60 
120 
180 
240 
300 
360 
420 



480 



540 
600 
660 
720 
780 



812 



<210> 816 
<211> 999 
<212> DNA 
<213> Homo sapiens 

<400> 816 

aagccgcctt ctgagccttt ngcctctgtt gttcctcctg ctgcctgtga gttttcatgt 60 

gtgcatttcg gcttttgatc ttgaagaaga ctttgccnca ctccttgcag gggaagatgg 120 

tggtggggtc tgtctcgccg ctggtggtgc tgtgagaggg tgancncttt accncnacag 180 

tacccactct gggtgccncc aggcttctgc ttcccagags gkrtrrmmmc kmgggccttg 240 

ctttgccccc tgnaaaagct gccccctanc catagtatct cccaggcaaa gatgccatgc 300 

tcactgcaaa ctatggaatg aggtcagaac agaatcaaag taacgcttga tgggaaaagt 360 

tggccccaag accccagtac taagagggtc gcctgcgtct cacacacaca cactcacagc 420 

aagctttggg ataaaaggca accgggatgg ttgacatctg aatgcaatgg aacatgaagg 480 

tcagcttcag tccctactgg gaatgatttc atgagaaggt agcccagatg aaacacctct 540 

taaagatagt tgtgccaatt atttattccc ccaacccccc acaaaaacaa atttttttaa 600 

ataaaaggaa aagaaatagg attttttttt ctaaacctga ataaaatgac cacttttaaa 660 

acagrtagtt taaaagggtt acaaaacaag caggcagtcc aggtttcctg attaatgaag 720 

atggaggccg tgggttttca ctgtctctaa gtgacacaca gggctttata gttctgcgtc 780 

accctgaagc aagactgaat cttgatcatc caagagaaga tcggtgtcca caacttcagc 840 

ctcttccatg acacctccca actgctggac gacgtcgtcg tcgaggatgt ccacatcctt 900 

gtatgggttt gatcagactc agctggtcca ggggcagcag cmcgrcagca ccccacgggc 960 

ccgtagtcct ctcaatcgtg gctgccatct cagctgcaa 999 

<210> 817 
<211> 653 
<212> DNA 
<213> Homo sapiens 

<400> 817 

atttttaywt ttaaaacatt ttatgaggga taaaatatag tctttttcta tcagtatgtt 60 

cacacttcct ggcctctcat tgggaagctg taagatgtcc ttcaataaga tcctgaacac 120 

gcgacagaat aatctcatta gagctgctgc aattttctgg accatatggt gggtctatag 180 

tcaggacccc agccacacag agagtccttg gagcgtctcc ctgttcagtg atggggatgt 24 0 

ggttcttctc aagccatttc tttaggctgt tctttctctc ttccagatcc tctgggctgt 300 

atgctttgca gtctccagac gtgaacaaat gcatcagctt ctccctcact ctatggtccc 360 
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cttcattcat agtttcaaca gtckgcacag catgtcccat aattccggtc acagacatgc 

tgccatcttc aaggaagttc acaaggacaa tattggcaga gactgggtct gkagttaaam 

cccatccttt atactcattc ttctcactgg ctgtcactcg gacctctttg taaatgtaat 

cttgccattc taaggggcct ttcttcatcc attcactcat gattgccacc tggctaaatc 

agttaaaaaa ctcctcgcaa ctctgggtac tcagcaacca tgctttgagg aag 



420 
480 
540 
600 
653 



<210> 818 

<211> 1225 

<212> DNA 

<213> Homo sapiens 

<400> 818 

ggattctttc actgagcaca aagagttgtt ggggctttag catctgactg attttgttac 60 

ggggttgatt ctgaccatag gaagtatgca atgtgaatca ctatttacag agaaacctac 120 

aacagatgct tgatgttgta gaaactggga catatagata ccaagcaaaa ttataagaaa 180 

cctataaggt gttcaatacg cttgtgtttc caaaattcac tgtacatgat cagtttggtg 240 

ttcttgtacc acagttttta actgaaggaa ccagttgtaa cagtctcaat tttaactaaa 300 

acttgaagaa ctaaaacaac aatgcaaacc tttcagcatt gtttggccaa acttgttaaa 360 

actgtaatgc aagaaccaaa tgcacbgtga tgtggcacca actaattagc aagcatgaat 420 

ttttcaccca agagtgaaaa aaggaaaatc taccatggct tgaagttaaa gagcagaact 480 

cctgactacc attctatgac tgatcaaaag actaatagtt aaaaacctca gcaggccttg 540 

ttcacgatat gcagaaaaaa aagtgctgca gtttagatac ctctggaatt tttccacagt 600 

gtcacaggtt tgtaatactt gaagccctac atttctaaga atatatttct tgctcagttg 660 

tttcakgcaa gcccaagact ttgtaatttt taaagggccc aagatttttt tttttttttt 720 

tttttcaaat aacagaccag cttctttttc ttgcagttac agatgtaatt tcctttttgt 780 

tgtcaaacat aaggtaccaa atatgatgca ataaattgtt ttgaaaaaca gttgtgtgaa 840 

tatttcaact aatctgtgtt gggcttctgt gaaatacaca ggtggaaaca gaggtgcaag 900 

ccagagcaat ngtaatatgc tgtaaggcta gtgcagatgg gagcttttta gaaggggcta 960 

agtgctggtg tcagggaaat tccataatga agtagaatgc tgctcctgca ttaagatttc 1020 

attgagggca aggctggtgg caggtactat gaatgtaatt cataatttaa aaggaaaact 1080 

aaaaactatt ttgatttggg aaaatgagcc ttaatttgtt aaacctatac actgaggaac 114 0 

tagcctcagg ctttaatatt ctcattggca tttgccaagg tcctgaggcc aaataaggtt 1200 

taagttaaaa caaatccaat tgtnt 1225 

<210> 819 

<211> 1024 

<212> DNA 

<213> Homo sapiens 



<400> 819 



gacaccccag atgcagccac 
gctgtggccg tgggctcagg 
gacarggccg cgctcctgca 
agctactatc agagcttctt 
tcactgctgc cccggcctgg 
ctgtatggcc ccttctggat 
ctgacgmtgg tgctggccca 
aaggtgaccg tggcaggcat 
tggggcttcc tgcggtggcg 
gagactgtgt gcatctacgg 
ctcattccct gtgcctntgg 



caccagcaga agcgatcagc tgaccccaca agggcacgtg 
tggcagctat ggagccgagg atgaggtgga ggaggagagt 
ggagcagcag cagcagcagc agccgggatt ctggaccttc 
tgacgtggac acctcacagg tcctggaccg gatcaaaggc 
ccacaacttt gtgcggcacc atctgcggaa tcggccggat 
ctgtgccacg ttggcctttg tcctggccgt cactggcaac 
gaggagggac ccctccatcc actacagccc ccagttccac 
cagcatctac tgctatrcgt ggctggtgcc cctggccctg 
caagggtgtc caggagcgca tggggcccta caccttcctg 
ctactccctc tttgtcttca tccccatggt ggtcctgtgg 
ctacagtggc tctttggggg cgctggccct gggcctgtnc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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aaccaccggg ctggtaatca ccctctggcc cgtggtccgt gaggacacca ggctggtggc 720 

cacagtgctg ctgtccgtgg tcgtgctgcn ccacgccctc ctggccatgg gctgtaagtt 780 

gtacttcttc cagtcgctgc ctcnggagna cgtggctcct ccaccccaaa tcanatctct 840 

gccctcaaac atcgcgctgt cccctacctt gccgcagtcc ctggccccct cctaggaagg 900 

nccgggtccc acaggcaaca cctaagtgga ccaacccctc tgcctgtcct gccccccaga 960 

cgatgactga aggctccttt gacaccttga gatgantctg ctactttcca gacttttctt 102 0 

acaa 1024 

<210> 820 
<211> 631 
<212> DNA 

<213> Homo sapiens 
<400> 820 

atttttaywt ttaaaacatt ttatgaggga taaaatatag tctttttcta tcagtatgtt 60 

cacacttcct ggcctctcat tgggaagctg taagatgtcc ttcaataaga tcctgaacac 120 

gcgacagaat aatctcatta gagctgctgc aattttctgg accatatggt gggtctatag 180 

tcaggacccc agccacacag agagtccttg gagcgtctcc ctgttcagtg atggggatgt 240 

ggttcttctc aagccatttc tttaggctgt tctttctctc ttccagatcc tctgggctgt 300 

atgctttgca gtctccagac gtgaacaaat gcatcagctt ctccctcact ctatggtccc 360 

cttcattcat agtttcaaca gtckgcacag catgtcccat aattccggtc acagacatgc 420 

tgccatcttc aaggaagttc acaaggacaa tattggcaga gactgggtct gkagttaaam 480 

cccatccttt atactcattc ttctcactgg ctgtcactcg gacctctttg taaatgtaat 540 

cttgccattc taaggggcct ttcttcatcc attcactcat gattgccacc tggctaaatc 600 

agttaaaaaa ctcctcgcaa ctctgggtac t 631 

<210> 821 
<211> 635 
<212> DNA 

<213> Homo sapiens 
<400> 821 

aggttgctca cctgaaggag cacaggaggg ttttccaggc catgtggctc aggttcctca 60 

agcacaagct gcccctcagc ctctacaaga aggtgctgct gattgtgcat gacgccatcc 120 

tgccgcagct ggcgcagccc acgctcatga tcgacttcct cacccgcgcc tssgacctcg 180 

ggggggccct cagcctcttg gccttgaacg ggctgttcat cttgattcac aaacacaacc 24 0 

tggagtaccc tgacttctac cggaagctct acggcctctt ggacccctct gtctttcacg 300 

tcaagtaccg cgcccgcttc ttccacctgg ctgacctctt cctgtcctcc tcccacctcc 360 

ccgcctacct ggtggccgcc ttcgccaagc ggctggcccg cctggccctg acggctcccc 420 

ctgaggccct gctcatggtc ctgcctttca tctgtaacct gctgcgccgg caccctgcct 480 

gccgggtcct cgtgcaccgt ccacacggcc ctcgagttgg aacgccgacc cttacgaacc 540 

ctgggagagg aggacccagc ccagagccgg gctttgggag agttccttgt tggatttttc 600 

agggccttnc agcggcatta ccaacttgag gtttt 635 

<210> 822 
<211> 752 
<212> DNA 

<213> Homo sapiens 
<400> 822 

tgcttttatc ttgaatgtag ccttcaactt tgtgtaattc cttaccaaaa aggccacatg 60 
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gcttaaaatt caacacacat ttgtccccag tcttgtggtt tataatttcc acattgccat 120 

actgttcgat ccacagttta cccacaatga tattatgcac acagcaggtg ggatttgtcc 180 

atgtatatgc ctcattgtgt tcaaggagct ccaaggtgat ggttcctttg ggttctgctt 240 

ctacactctt cccccagaat ttcagtttgg gatagataga gccatgaaag atgaagtcat 300 

tgtttaatcc ttcagcatga aatgcactga ttggtgggtg atggctgacc tgttcggaga 360 

tgagtctaaa tccaaggtca tctcgcacta attcataagt ctctcccagc agtgggttga 420 

aaggttttcc agtccgttcc cactgagaag caacagcaga tacagcaaac gcagctacac 480 

actgcatcct ttccacagga tcagagagtg aactggcctt gtggaygagg taagtatgct 540 

ccatgtattc agttaggcgc tgtaggaagc tcagaggctc attaaatata actggcatcg 600 

tgatcttgga tagttccatt ccaatacatt ttctgaggat gctccagata ctgaagtcat 660 

ttctggaaaa cataggagaa ggcaaacttg ttctgtgttt cttgatgcca ttggagagag 720 

catctccgcc accacagtct ttttcttcgg ac 752 

<210> 823 
<211> 899 
<212> DNA 
<213> Homo sapiens 



<400> 823 

tttgccacag ggtaaacttt tattttagaa tccaatcttt tccccacaca tacacaataa 60 

attaaacaga atccacagta aatgtacatt ttttaacata aaaagtcagt tactgttact 12 0 

tcatgatcac atgaggatcg tcacagctcc gtgtccatta gcacattacc ctccttgtcc 180 

ttaactctta tccgaccgga tctgtacttc gtttcttgat gaccgtttgc atatacggtt 240 

ttaacagtgc catctgggta ttcccgtctc ttgaactggg cagtatgtag ttctctttgg 300 

ccattattaa actctatgag tttgttgcca tcacgttgta ctctgacaat tgtaccatct 360 

gggaaaatgc tttcttcttg tccatcagga aataagtttt taacagtctg gtcaggaaac 420 

gtgatttctt ttcttccatc tgggtaatgt ttttctrttt aaaaagttgt tacagtaaat 480 

attttttgaa ggaagggaag aatttaatga gagggtggag caagtttgta cctatttgtc 540 

cacttgagaa atgtaagact tccagtcctc cgggtatgtc gtgtgagtgg tctgggcagc 600 

tgcatagtag tagatctgta aagacacaca gtcagtctgc cttttctcca gagatggtta 660 

aactatggag gagaacactt ctggaaacat accactcttt ggtctggcat gacctgcttc 720 

acgtcaccat taaagaaagt gacagtgatg gtcttcccat ctgcactcac ttcctttcga 780 

gttccattgg gaaacagtat aacacggcac ccattcttat aaaccttttc cacctttcca 840 

tcaggatgac tgatttctcc ctgtatgtct tggtcttcct cctcctcttt atattcagg 899 



<210> 824 

<211> 1980 

<212> DNA 

<213> Homo sapiens 



<400> 824 

acccgtccgg ggccggccaa tttgcatatt tggaatgcgc cgctataaac ccggctgggg 60 

ttttgcagcg atttcttaga tgtaaaaatg agatctcaat agcagcgggc tgggcacatc 120 

ctcksmwytc ysskwswskm tstgcccrga gctggtttcc gtctctcggc tcggggctgg 180 

aactccggcc caacctaggc gcgcanccgc sacgagatgg cgcacttccg atcaatgtca 240 

aagccgccgg ggagccggga accccagcat gattcttggc ctttgttcgc ttctgatact 300 

aagagcagca cggtacatta tttcacttgt cccgctcccc ttcataacag aaaaagggga 360 

ctcaccctca agaagtgatt ggtatggtaa tttaaagcaa cgcgcattcg ctaggcctcg 420 

cgagcgtcgc cgcgcggaga agccagctgt cccttggcag tgatttcgga aatgtgtcaa 480 

ggcaattcca aaggtgaaaa cgcagccaac tggctcacgg caaagagtgg tcggaagaag 540 

cgctgcccct acacgaagca ccagacactg gagctggaga aggagtttct gttcaatatg 600 
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taccttactc gagagcggcg cctagagatt agccgcagcg tccacctcac ggacagacaa 660 

gtgaaaatct ggtttcagaa ccgcagatng aaactgaaga aaatgaatcg agaaaaccgg 720 

atccgggagc tcacagccaa ctttaatttt tcctgatgaa tctccaggcg acgcggtttt 780 

ttcacttccc gagcgctggt cccctccctc tgtcttcagg ctctgccagg aactcgcacc 840 

tgtgctggag ccctgttcct ccctcccaca ctcgccatct cctgggccgt tacatctgtg 900 

cagggctggt ttgttctgac tttttgtttc tttgtgtttg cttggtgctg gttwatttgt 960 

tgttttctgg gggaaaaagc catatcatgc taaaattcta tagagataga tattgtccta 1020 

agtgtcaagt cctgactggg ctgggtttgc tgtcttgggg tcccactgct cgaaatggcc 1080 

cctgtcttcg gccgagcntg gtttcctgcc cagcctgggg caaacctagc cgaaggccga 1140 

ggtcccattg ttggcgctga ggtgtctggc ctgaggtcaa tggtgcaaag gagccgccac 12 00 

cggcatgtct gcctggagtg ctgtgctgtg tttaatcagg ggatacaggc ccctgggttt 1260 

cttttttctt tcttcctttc ttccttggcc aagagaaggg cttacaggca tggacatgca 132 0 

ggttggcaaa cgggcttgac tttggctgat ttaaaaagtg agaaagaaag taaaaaaggt 1380 

taatttttcc ttcctctgta agatatccca gctttaaaaa gaaaaaaaaa aagaattacc 144 0 

aagagaaggg gacttctctt ccagtttctg taaggtctta cattgcctga ctaaaatgtt 1500 

tcatttacct ctaaatttcc atatccttct ggctgtagat aaataatgta gttttgttta 1560 

tgcatttgga attagtggat ttttttgtca ttaaaattgt taccactggt aacatgtgac 1620 

aagcacacca caattctccc tatcttgtga agttgttttt ttaaatcgcc ttgaacaaaa 1680 

agtttttttt tttgtttgtt tttgctttct gaaattcaca gaagcctagg aggactgggg 1740 

taagcggaat aaactagaga agggagacat tgtttggatt tccttcctat aaatacaaat 1800 

ctgtataaat gtctattatt atgaagaatt gccaatcttg ttttaagcaa atgcattcta 1860 

tcgttattat aaatgttagt tctagctcta tttacttcta atcttaaatc agaataaatt 1920 

aatattgtat tgctgctgtg cgtggaaaaa gacgatgttt atgttcttat agaataaaag 1980 

<210> 825 
<211> 333 
<212> DNA 
<213> Homo sapiens 



<400> 825 

tctagatatt gcccaatcgc tgcccacagt gcacatacct ttccaccagt cacatgtgag 60 

agggcagatt ttccaaatgc tcatcaccac ttggcactgt gtggactata attttggcca 120 

gttaggaaat ggcatctcat tgttttcatc ttaatttgcg tcagcctgat tactcattga 180 

aacttgtgag gttgagaaac ttttcttaag cttattggcc attcaagttt cctcctttat 240 

gaaatggttg ttcatgtcat ttkctcattt ttatattaga ttgkwtttmt wttttccagc 300 

tgacttgtag gaactctaca tcttatcaat att 333 



<210> 826 

<211> 658 

<212> DNA 

<213> Homo sapiens 



<400> 826 

tttttttttt tttttttttt ttttgaaggc ttcatgaata atttattcca tttgaagttt 60 

tgttttttgt ttttgttttt ttttttttaa aaagtataaa ccttttcatt tcctcaatca 120 

caatttgtac aactcagtgt tatggcattc ggcagcaata gtgtttgttc cttattctct 180 

ttttgtcacg ttaaaaanaa agcaattgga ccatattaaa tgtcactgct aaacaacaac 240 

tttaaaacgc cccttcataa agtgaccaag ctattttgag agggttgatg ctgacatgtc 300 

cagtaatgac gttacaattt gtagcttaaa ctcaataact ttaaggtcca catatccagt 360 

ttactttgaa aactaaagat gttttaaaac ttcatgaata catcaacctg aggagtattt 420 

taggkcccaa atccagtttt taaatttata ctccacnaaa aangaaaata catacataaa 480 
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awtttaaacc mcngttytgg gcccattwaa acaccmaaaa agaccccccn aaaagttaag 54 0 

anttccagct tanttctgga ngggtgggnc aaaatarraw kktwtawwma wwwymytwwt 600 
ccnkmattca gacaaactaa aatcttaaga ggaaacccag accaaaatat cactcatg 658 



<210> 827 

<211> 453 

<212> DNA 

<213> Homo sapiens 



<400> 827 

attatagaga ttaatctcct ttgctcgaag tctntttaaa tattagtcac atctaaaaca 60 

tacttttaca gcaacatcta gactggtgtt tgaccaaaca actgggcatc atagctgaca 120 

cataaaatta accatcacaa ccatgttcta ggcactgttc ctcactgcct gagaagacac 180 

cgttatgttt attagggttt ttgagtttta tccacagctt ttggttatct gcaaccatgt 240 

ctcccaccat taacatagtt cacactgaga tgaggattcc ctatttaaca cttggtccca 300 

acttcttcac agtccatctg gttttgtaga gggaacataa ctggacattc tggtcaggtt 360 

aggtgaggtc aggccttcag gacgctattt tcactgagtt gctttataag gcacattatg 420 

caaaattcca tcagctcttc tgttcactac att 453 

<210> 828 
<211> 657 
<212> DNA 

<213> Homo sapiens 



<400> 828 

aagagaagga cctagagatt gagaggctta agacgaagca aaaagaactg gaggccaaga 60 

tgttggccca gaaggctgag gaaaaggaga accattgtcc cacaatgctc cggccccttt 120 

cacatcgcac agtcacaggg gcaaagcccc tgaaaaaggc tgtggtgatg cccctacagc 180 

taattcagga gcaggcagca tccccaaatg ccgagatcca catcctgaag aataaaggcc 240 

ggaagagaaa gctggagtcc ctggatgccc tagagcctga ggagaaggct gaggactgct 300 

gggagctaca gatcagcccg gagctactgg ctcatgggcg ccaaaaaata ctggatctgc 360 

tgaacgaagg ctcagcccga gatctccgca gtcttcagcg cattggcccg aagaaggccc 420 

agctaatcgt gggctggcgg gagctccacg gccccttcag ccaggtggag gacctggaac 480 

gcgtggaggg cataacgggg aaacagatgg agtccttcct gaaggcaaac atcctgggtc 540 

tcgccgccgg ccagcgctgt ggcgcctcct gaccgtcgtc tcctcactcc gccttttcaa 600 

atttttgtat aaccccgtgt tgtgtaaata cagtttttgc tccggtaaaa aaaaaaa 657 



<210> 829 
<211> 775 
<212> DNA 

<213> Homo sapiens 



<400> 829 

ggtttgagaa aatcaattca aatctgnccc ttctgattgc anctctaacc aggttctgan 60 

cggtgtcaga gacttcccaa tacatttccc ttctagnatg cctcataaat ccactcaaaa 120 

gtaagacacc aaacacacac ctcatttcct gaactgtgac ttccaagctg acatttttct 180 

gagaagcata attattggtt tcattgacaa ttaagttgaa tgtttcatca tcaaaaaata 24 0 

attcaaaaag ctctactggg ttcaactttt cgctcttgag attcaaaagt ccagaatcca 300 

gtgctgacca gcttggaaaa ttgggtttaa tgtctctttt ggtccaactc ttttctggga 360 

aacatgatac ctttaacttc ttttgagcag gctggatctc aggctcatta tctttttcca 420 

catctgagtc accagagaat gagaggcctt ggagcagttc actcactcga gctttgtctt 480 
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tttttctccc ttttcgggta atgtctcctg cagcatattc cagggatgag atgtgcatgc 540 

gggcccacaa atcacctggg tgacggtcct tcagagtgtt caaatgtgca actgtccttt 600 

cagtagcaat aggagtacta caaggaatct ggggtgcaca ctctctgttg ggctttcctg 660 

aggcttctcc actttgttcc atttcttcag aagtttcttg ctttgcttta aacaatctat 720 

ctttagttac aatttcttca gctgggtgta gccccagctt tttagaaggc tgagg 775 

<210> 830 
<211> 413 
<212> DNA 
<213> Homo sapiens 

<400> 830 

agagcctgca agtgacaaag gaagtgaggc agaggcccac atgcccccac cgttcacacc 60 

ctacgtgcct cggattctga acggcttggc ctcggagagg acagcactgt ctccgcagca 120 

gcagcagcag cagacctatg gtgccatcca caacatcagc gggactatcc ctggacagtg 180 

cttggcgcak agcsmcasgk gcagtgtggc ntgctgcccc ccaggaggcc tgaggctggg 240 

tctcactgct ctgaaaagac acaaccagaa tggcctgggg ctcaggccct tggctgagtg 3 00 

ggaatgcgtt gggactgccc agctgagcta tcaggtgccc atcttttctg gtmccagcag 360 

tggtgaggag agcacaggca ggcctcgccc ctcccttgct canccagttt ccc 413 

<210> 831 
<211> 876 
<212> DNA 
<213> Homo sapiens 

<400> 831 

gctgacctac agcagaagct gctggatgca gaaagtgaag acagaccaaa acaacgctgg 60 

gagaatattg ccaccattct ggaagccaag tgtgccctga aatatttgat tggagagctg 120 

gtctcctcca aaatacaggt cagcaaactt gaaagcagcc tgaaacagag caagaccagc 180 

tgtgnykaca tgcakaagat gctgtttgag gaacgaaatc attttgccga gatagagaca 240 

gagttacaag ctgagctggt cagaatggag caacagcacc aagagaaggt gctgtacctt 300 

ctcagccagc tgcagcaaag ccaaatggca gagaagcagt tagaggaatc agtcagtgaa 360 

aaggaacagc agctgctgag cacactgaag tgtcaggatg aagaacttga gaaaatgcga 420 

gaagtgtgtg agcaaaatca gcagcttctc cgagagaatg aaatcatcaa gcagaaactg 480 

accctcctcc aggtagccag cagacagaaa catcttccta aggataccct tctatctcca 54 0 

gactcttctt ttgaatatgt cccacctaag ccaaaacctt ctcgtgttaa agaaaagttc 600 

ctggagcaaa gcatggacat cgaggatcta aaatattgtt cagagcattc tgtgaatgag 660 

catgaggatg gtgatggtga tgatgatgag ggggatgacg aggaatggaa gccaacaaaa 72 0 

ttagttaagg tgtccaggga agaacatcca agggtgttcc tgcaagggct ggtgtgggaa 780 

ccangccagt gtggggttcc aggnaagcca aaagtncaga ctggtggtgt tgactgtttg 840 

ctgtgacccc cacaaagttt ncggaaccgc ccacca 876 

<210> 832 
<211> 768 
<212> DNA 
<213> Homo sapiens 

<400> 832 

tagacataga aaacatacag taagaatatg gtattataat cttacggsam mamygysrmm 60 

trnsckkknw rwmktkgwaa agykgyrmyr sgrcsyanra mtanmmmtas ctrgytrrky 120 

mrywtwwmma tycctksccm gggagtttga aatttnatac tatagaaata actttaggtt 180 
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ttaggtagag ttaaagaggt aaagcacatg ttgnccacaa 
gaaagattgg attttcctac ctttagagat ctaaaaaaaa 
tgtgttggtg tttattacta gttcagatga gtggctgctg 
tcattataac ccaatttcca cttatttgaa ctcttaagtc 
gaattagcac agttaagttg acactagaaa ctgcccattt 
ggaaacattg gaaagatggg gaaaaaaatc ttattttaaa 
attactttga aaattctaaa cttctttctg tttccaaaac 
tcatgcatta agactgtttt caaagctttc ctcacatttt 
aatatacata tttattttcy ttaaagcagc tatatcccaa 
acccataaaa ccmatataac agcaggggta ttggagcagc 



ncccaggaaa gtatttttaa 
tttaatataa aaaatcattt 
aaggggcccc cttgtcattt 
ataaatgtat aatgacttat 
ctgtattaca ctatcaaata 
atggcttaga aagttttcag 
ttgaaaatat gtagatggac 
taaagtgtga ttttcctttt 
cccatgactt tgggrgatat 
tttctcaa 



240 
300 
360 
420 



480 



540 
600 
660 
720 
768 



<210> 833 
<211> 1604 
<212> DNA 

<213> Homo sapiens 
<400> 833 

aactagtata tttacaacat cagaaacttc aatatggaga tttgttgttc ctatatcatg 60 

atctttagca gcaactacac cataggcact gcacaacctg ggtcctagat caggacgtac 120 

aaaaaatcct ggcaaatgag aggccaaatt gaattttcct tctggattac aatattctgg 180 

caatggcaga ctttttaaaa gatcttcgta tcttgctggc atcatagtct tgaagtcttc 240 

tcctgaaggc caatctttca attttaaaac aactgtttct ccactcttgt ttttctgccg 3 00 

ttttgaaact tcttcaaaac catcccagaa ttccttaaca ttggcatttg aaatgatgct 360 

atctttgcag ttcaggagat cagcttggtg gtctccaaaa tcaagactaa ttgattccgc 420 

cttccatagg ctaatgttca ttttcttatg cacaccagaa accactgcag gctgtccttg 480 

tttccaacat tctttgaaaa gcttccaatt actgctattc ttataatcct taagccataa 540 

aatatgcttc tcacagatcc aagaatgtgg tatatcactg tataatttat tattttcatc 600 

cactgcagat attatgcttt cttcaggctc ttctttaagc tctggtttta catttatctt 660 

ggaggtttta cttggtggaa ttttgttttc aacaactgaa gcaattatgt catcaagaat 720 

gttaggcata gtccgtccac ttttgctact tggggctccc attgaatata ctggggcaaa 780 

ggcaatgcca gcatctgtas accccacacg tagctttcca gctgttgtag tcagcaaatc 840 

ccgtaaggtt gagccttgtt cattattctg ggacacaaga ggtgatgttc tgccatttgg 900 

agattcagag ttgtcttgtt ctctttcttc tttaatttgg ttttcaaggg taagttcttt 960 

gttttctttt ttttcctctc tggctttttg ctctgcaaga tctgctaacc agtgcagtgg 1020 

tgactgggat tctggaggag ttaacttgtt atctgtgcct acatcactct ctgggctgct 1080 

gccaccattt ttctcagact tcggaggagt attttgctgc tgagactcag gcatgcacag 1140 

agaaatttta ttactgtgat taagaacatt ctgtaaaact tgagatacac cat teat tgt 1200 

aggaaaattt ccaacttgta aattctgttt gttagtacaa tgacaatggg atttaatacc 1260 

atatttttcc ctaagagtgt gcatggcatc tagaagatct gtcaaaacag aaccaggtat 1320 

aatttgggtt ggcattaaat gtttgtgatc atgaggctgt ccyttcacac acttcatcca 1380 

agcatantag ttctttatcy etagaactne tycctttcct ttngccttgt aacaatctaa 1440 

gcaganccac aawkecacat tttkggcaga cccagtnraw kktaancawk gntgettcac 1500 

atgeatcaca catctcccgg actcctctca ctgctctttt ccaggcaatt ttggcatcct 1560 

ttttcaccca ggacaaagct gttttttcag atgttactaa ttga 1604 

<210> 834 
<211> 617 
<212> DNA 

<213> Homo sapiens 
<400> 834 
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gtccgtcagc tggtagcttt cattcgtaaa agagataaaa gagtgcaggc gcatcgaaaa 
cttgtggaag aacagaatgc agagaaggcg aggaaagccg aagagatgag gcggcagcag 
aagctaaagc aggccaaact ggtggagcag tacagagaac agagctggat gactatggcc 
aatttggaga aagagctcya ssangrtgrm srcrsgkkac gagaaggagt ttggagatgg 
atcggatgaa aatgaaatgg aagaacatga actcaaagat gaggaggatg gtaaagacag 
tgatgaggcc gaggacgctg agctctatga tgacctttac tgcccagcat gtgacaaatc 
gttcaagaca gaaaaggcca tgaagaatca cgagaagtca aagaagcatc gggaaatggt 
ggccttgcta aaacaacagc tggaggagga agaagaaaat ttttcaagac ctcaaattga 
tgaaaatcca ttagatgaca attctgagga agaaatggaa gatgcaccaa aacaaaagct 
ttctaaaaaa cagargaaaa agaaacagaa accagcacag gatgtacctg gcaaagattc 
atatctgcct gcagctc 



540 
600 
617 



120 
180 
240 
300 
360 
420 



480 



60 



<210> 835 
<211> 542 
<212> DNA 
<213> Homo sapiens 

<400> 835 

tttttttttt agaccaacat tctttaatca caaaggcact tgaggacccc tacaaaccca 60 

aagtctctgc caagagtggc cctgcagacg ccccacctgc caccctccat ccacccatcc 120 

atccacacac tcagagttca tcgtgacctg cagagggctc cacactaggc ttgatgaaga 18 0 

tgccttccat ggccttccac gtattgtgcg tgttggcact gggcatgccg tggacctcat 240 

gctgcccacg gatggggctt ccatactgct cacccgtgac tgacaggaac acagaggtgc 300 

ccacatgctn grarsgcaca gcagcctcac gctcccagnn gctgntccag agcagcgcac 360 

tgtccatann gktccaggtc gtcgccctcg ccgtcttccc caaaggcact cacctcctgg 420 

ttgttggaca gcggcgangg gaagtggtgc gtgtgcaggt tcnttgnccg taagcacatg 480 

cgtgagcctc accgcctgcc cgcagcgcac cgcaagggcc caggcggagc cgacgctcgc 540 

gc 542 

<210> 836 
<211> 542 
<212> DNA 
<213> Homo sapiens 

<400> 836 

tttttttttt agaccaacat tctttaatca caaaggcact tgaggacccc tacaaaccca 60 

aagtctctgc caagagtggc cctgcagacg ccccacctgc caccctccat ccacccatcc 120 

atccacacac tcagagttca tcgtgacctg cagagggctc cacactaggc ttgatgaaga 180 

tgccttccat ggccttccac gtattgtgcg tgttggcact gggcatgccg tggacctcat 240 

gctgcccacg gatggggctt ccatactgct cacccgtgac tgacaggaac acagaggtgc 300 

ccacatgctn grarsgcaca gcagcctcac gctcccagnn gctgntccag agcagcgcac 360 

tgtccatann gktccaggtc gtcgccctcg ccgtcttccc caaaggcact cacctcctgg 420 

ttgttggaca gcggcgangg gaagtggtgc gtgtgcaggt tcnttgnccg taagcacatg 480 

cgtgagcctc accgcctgcc cgcagcgcac cgcaagggcc caggcggagc cgacgctcgc 540 

gc 542 

<210> 837 
<211> 719 
<212> DNA 
<213> Homo sapiens 
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<400> 837 

aaaaggtccc ccttctggga aagaccgagt gaagaaaggt ggatcctaca tgtgccatag 60 

gtcttattgt tacaggtatc gctgtgctgc tcggagccag aacacacctg atagctctgc 120 

ttcgaatctg grnttccgct gtncagccga ccgnctgccc actatngact gacaaccaag 180 

gaaagtcttc cccantccaa ggagcagtcg tgtctgacct acattgggct tttctcagaa 240 

ctttgaacga tcccatgcaa agaattccca ccctgaggtg tttnacatac ctgcccaatg 300 

ncaaaggaac cgccttgtga gaccaaattg ctgacctggg tcagtgcatg tgctttatgg 360 

tgtggtgcat ctttggagat catcgccata ttttactttt gagagtcttt aaagaggaag 420 

gggagtggag ggaaccctga gctaggcttc aggaggcccg cgtcctacgc aggctctgca 480 

caggggttag accccaggtc cgacgcttga ccttcctggg cctcaagtgc cctcccctat 540 

caaatgacag ggatggacag catgacctct gggtgtctct ccaactcacc agttctaaaa 600 

agggtatcag attctattgt gacttcataa gtgagaattt atgatagatt attttttagc 66 0 

tattttttcc atgtgtgaac cttgagtgat actaatcatg taaagtaaga gttccctta 719 

<210> 838 
<211> 579 
<212> DNA 

<213> Homo sapiens 



<400> 838 

aagatatgca gagatattcc aggatctttt agctttggtg cggtctcctg gagacagtgt 60 

tattcgccaa cagtgtgttg aatatgtcac atccattttg cagtctctct gtgatcagga 12 0 

cattgcactt atcttaccaa gctcttctga aggttctatt tctganctgg agcagctctc 180 

caattctcta ccaaataaag aattgatgac ctcaatctgt gactgtctgt tggctacgct 24 0 

agctaactct gagagcagtt acaactgttt actgacatgt gtcagaacaa tgatgtttct 300 

tgcagagatg attatggatt atttcattta aaaagttctt taaggaaaaa cagtagtgct 360 

ctgcatagtt tactgaaacg agtggtcagc acatttagta aggacacagg agagcttgca 420 

tcttcatttt tagaatttat gagacaaatt cttaactctg acacaattgg gatgctgtgg 480 

gagatgataa tgggtctcat gggaagtagg aggggagctc atacatcacg gacgatgagt 540 

attaatgctg cagagttaaa ccagcttctt ccaaggcaa 579 

<210> 839 

<211> 1172 

<212> DNA 

<213> Homo sapiens 



<400> 839 

aaccaaacct cccaacttag tgaaaacaag gcattcaatg acagaccagc agcagaaact 60 

gcntattacc tcctaatcat tttatgaaga aatacctata taaaaacaaa cactaaagag 120 

nacaaataga tttaactaaa gtgacaagca taattataaa taaataccag attatcagat 180 

tttaaacaat aatctataac agttttacta tctaaggatt ttcactccaa gaagaaaaaa 240 

tacatagtaa cgccaagctt gcaggacgat gacttaacag atacattttc tcttaatgga 300 

aacttatcta gcttcagtaa tatttctgga tgtagcatca agttgctgtt gcacattttt 360 

aaaagactgg tccagcagtg tttcctcttc atttaaagta ttggcaatag catcattaca 420 

tggattgtcc agaatgtctt cgtttaatcc atttgactcc tccttttgat cctcatcagt 480 

attaacctct tcaaccgtgt gtgccctggg tgtattcatt aacatatcat ttccyagggt 540 

ctgactatta ctcagcagct tkgcctgcct tctttccarg gccagttggt twatttcycy 600 

caattctttg ttgttgctct tctgttaggc ttctacttaa ctcagaagca aacatctcac 660 

tttcagataa gtttgtcaga aagggatcta attcagtaga agtgacatca tgttcattat 720 

tctccgcaac ttcatcatta ttgctaacaa aatcttcatg taaaataggg agatcaagtc 780 

gaattcgttt taaacaggtc tgaacttcct ttttacttcc caggtattca actctgtcaa 840 
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taaaatcctc aaactgcagt ttagggaata gcctatgtgc ccagtgctcc 

ttagcatctt caagtcttca gcctcatgac ctttaccttt gaattttgcc 

catgccttaa ggctggaagt cctctctctg aaattaatct ctgagcatcc 

tatttctttt aactgttctc tttggaggta caggaacagg tgctccattt 

catcaggctc agttccttca ccatcttgtc tctctggaga ggctggaggt 

gaaaagtttc atcttctaca tgctcataat ct 



atgtgtctga 
ttatcaaata 
agcttgggta 
cctgactctt 
gggaaaggag 



900 
960 
1020 
1080 
1140 
1172 



<210> 840 

<211> 1145 

<212> DNA 

<213> Homo sapiens 

<400> 840 

cctcctactc ccaaacaaat ctttggggaa aaaaaaacta ccaactgtca gccatgggcc 60 

tgacggcgct aagctctggg gctccgtgca ctgacgtggg gccagccaca gggaggcggg 120 

gatsmrgymg cgngassscm ggakywkgrs cwscwscsrs gymrgkwgca gnrgcrgygg 180 

crhcrsganc mrmagcagcn tgmwgcagct cawgcacctg gagtcctttt aygaaaaamc 240 

yyctcctggg cttatcaagg aagatgagac taagccagaa gattgcatac cagatgtacc 300 

aggcaatgaa cacgccaggg aatttctggc tcatgcacca actaaaggac tttggatgcc 360 

actggggaaa gaagtcaaag ttatgcagtg ttggcgttgc aaacgctatg gtcaccgaac 42 0 

gggtgacaaa gaatgccctt tctttatcaa aggcaaccaa aagttagagc agttcagagt 480 

ggcacatgaa gatcccatgt atgacatcat acgagacaat aaacgacatg aaaaggacgt 54 0 

aaggatacag cagttaaaac agttactgga ggattctacc tcagatgaag ataggagcag 600 

ctccmgttcc tctgaaggta aagagaaaca caagaaaaag aagaagaaag aaaagcataa 660 

gaaaaggaag aaagaaaaga aaaagaagaa aaaacggaag cacaaatctt ccaagtcaaa 72 0 

tgagggttct gactcagagt gacaaggatg tgacttgttc aacattctct tctcaaacac 780 

tgaccaagga acagaggaag atgcagtcag agaaagcagc aggatagaga cgccgagaga 84 0 

ggagtatatg tgggtcacag cagtgagctc ccacccgcct tgcagtgaag atgtgacccc 900 

aggagaggga gtgtctcctt ccaggtgcta gctctggaca gcagctgatt ttaggcagga 960 

aagtttcttc atcgttgtcc tccctgctgg tcacatgagt ttacgattcc tttgaagtgt 1020 

ctcccacagg gtggcaggac tgggagaatc tctgaggcgt gtcttccagg ccctcccaca 1080 

gcttgtgccc tccacagtgt ggactcaggt cccatagaca tcaggctgga gtcttctctg 1140 

ttgtt 1145 

<210> 841 
<211> 642 
<212> DNA 
<213> Homo sapiens 

<400> 841 

ttttttataa aaataaatat ttattgccat ttgaagcttt atgtacacct ttaaaagcac 60 

atgtacaaat gtgggaaatt acaaaaatca acctaaaacc ctttttctca aagtatacat 12 0 

aaatgtacat ccaagatcag tggtgctacc atcattagaa taaaaaataa gtctgtctgg 180 

acataaacaa gcaatcattt taagtgtcat tcagatattc tcctttatat ttaaaactcc 240 

aaaaaatact aagaggccca atatatccag aaaattgtgt tttcacttta ccctaactta 300 

tgaatagtgg tatacaaata tatttccatc tttttgtcca gccagcaaat gagagtctgt 36 0 

acccgaccat ttcacaaaag accaatgttg gtcagagaca gskskgagrr ksgymktasr 420 

stkamysasa akkarstsmm amayrgsrmt tnykcmasra stcamkmtyk ytgsyrcasr 480 

gwkrwctyws rmswmwmmwk msargmmcca tttcagaata ggctttgtga cagactgaag 540 

cttggtaaga atcatcaatg tgcatctttt tcaggagttg accagttttt aaattccaaa 600 

taacaatgtt gttcataata gtagtaccaa gcagagcttc tt 642 
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<210> 842 
<211> 452 
<212> DNA 
<213> Homo sapiens 

<400> 842 

acggcctggt ggagcagctg tacgacctca ccctggagta cctgcacagc caggcacact 60 

gcatcggctt cccggagctg gtgctgcctg tggtcctgca gctgaagtcg ttcctccggg 120 

agtgcaaggt ggccaactac tgccggcagg tgcagcagct gcttgggaag gttcaggaga 180 

actcggcata catctrcagc cgccgccaga gggtttcctt cggcgtctct gagcagcagg 240 

cagtggaagc ctgggagaag ctgacccggg aagaggggac acccytgacc ttgtactaca 300 

gccactggcg caantgcgtg accgggagat ccagctggag atcagtggca aagagcggct 360 

ggaagacctg wacttccctg agatcaaacg aaggaagatg gctgacagga aggatgagga 42 0 

caggwagcaa tttaaagacc tcttttgacc tg 452 

<210> 843 
<211> 805 
<212> DNA 

<213> Homo sapiens 
<400> 843 

ggcttataca acatagtggg gaacgcatgg gaatggactt cagactggtg gactgttcat 60 

cattctgttg aagaaacgct taacccaaaa ggtccccctt ctgggaaaga ccgagtgaag 120 

aaaggtggat cctacatgtg ccataggtct tattgttaca ggtatcgctg tgctgctcgg 180 

agccagaaca cacctgatag ctctgcttcg aatctggrnt tccgctgtnc agccgaccgn 240 

ctgcccacta tngactgaca accaaggaaa gtcttcccca ntccaaggag cagtcgtgtc 300 

tgacctacat tgggcttttc tcagaacttt gaacgatccc atgcaaagaa ttcccaccct 360 

gaggtgtttn acatacctgc ccaatgncaa aggaaccgcc ttgtgagacc aaattgctga 42 0 

cctgggtcag tgcatgtgct ttatggtgtg gtgcatcttt ggagatcatc gccatatttt 480 

acttttgaga gtctttaaag aggaagggga gtggagggaa ccctgagcta ggcttcagga 540 

ggcccgcgtc ctacgcaggc tctgcacagg ggttagaccc caggtccgac gcttgacctt 600 

cctgggcctc aagtgccctc ccctatcaaa tgacagggat ggacagcatg acctctgggt 660 

gtctctccaa ctcaccagtt ctaaaaaggg tatcagattc tattgtgact tcataagtga 72 0 

gaatttatga tagattattt tttagctatt ttttccatgt gtgaaccttg agtgatacta 780 

atcatgtaaa gtaagagttc cctta 805 

<210> 844 
<211> 702 
<212> DNA 
<213> Homo sapiens 

<400> 844 

tttttttttt tttttttgca ggtgcatttg tttctttatt taaaaaaatc atctgggggc 60 

atggtctgag gaggacaccc ctcccatggc tttggggagg acgcaggttc caggagtcac 120 

agggcagaaa cacgcggggt gggtgggggc gtggccggag tggggagggg ctgtsccagg 180 

cacctggggg tggctcccac ggcaccaggt gggctagggc aacagtatgt acaggcgagc 240 

agtgctcctg gacccggtcg gggccggctg gggcccattt ctgcggcagg ggagctctgg 300 

ggcacagggt ctgagtccca tcttgggctk cagggaccgc gaggscgtcc agggaggctg 360 

gacagcgggg gcctttatct gggcccatca ggtggatgag aacggacact gcaaaccgct 42 0 

caccacctgg gccagggcta ggctatccgg cagggcctcc ccmmctgaat cctgcgtgcg 480 

cagaactcaa gccggcatnc aggcagtkgg aacgncccgc angctgggct tggktgsyck 54 0 
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crsgcacgtg acaggtgggg cccgtgtcct gataaacgga caggaacaaa aggaacgcaa 600 
ggtctgggac ccacggctct gggagcagcg ccacccaggc tggctcctag cagagaaatg 660 
ggaatcgcaa atgcattgca atgtgcagtg aagagacgcg ag 702 
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because they relate to parts of the International Applioation that do not comply with the proscribed requirements to such 
an extent that no meaningful International Searoh can be carried out, specifically: 
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3. Claims Nos.: 

— because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international applioation, as follows: 

SEE ADDITIONAL SHEET 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
I — ' searchable claims. 
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FURTHER INFORMATION CONTINUED FROM PCT/1SA/ 210 



1. Claims: Invention 1: claims 1-7 

A library of polynucleotides comprising the sequence 
information of at least one of the sequences 1-844. 



2. Claims: Invention 2: claims 8,13-19,21 all partially 

The isolated nucleic acid with seq.ID 1, sequences with at 
least 90% sequence identitiy therewith and degenerate 
variants thereof, host comprising said nucleic acid, peptide 
encoded by said nucleic acid, antibody against said protein, 
vector comprising said nucleic acid, and a method for 
detecting the differential expression of said nucleic acid. 



3. Claims: Inventions 3-845: claims 8-22, all partially, 
as far as applicable 



As invention 2, but limited respectively to the seq.ID's 
2-844 

For the sake of conciseness, the second subject matter is 
explicitly defined, the subject matters of inventions 3-845 
are defined by analogy thereto. 
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In view of the large number of libraries, which are defined by the 
general definition 1n the Independent claim 1, the search had to be 
restricted for economic reasons. The search was limited to the libraries 
for which data was given 1n the description, or libraries derived from 
cell lines mentioned in table 4 of the description, and to the general 
Idea underlying the application (see Guidelines, Part B, Chapter III, 
paragraph 3.6). 



